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ABOUT THE COMPANION WEBSITE: 

www.oup.com/us/ohss 


Since this Oxford Handbook deais with sound and a book is an overwhelmingly 
visual médium, we have taken advantage of the Web to post sound samples of some 
of the sonic material considered by contributors. At the companion website, authors 
of many of the individual chapters have posted pertinent examples that can be lis- 
tened to by following the links at the website. Occasionally there are visuals, such as 
videos, which accompany the sound samples; one of the major themes of our hand- 
book is that the visual and sonic often work together. Enjoy what you hear (and see) 
and, if you find more relevant examples, do not hesitate to let us know: the beauty 
of the Web is that material can be continually updated there. 

Trevor Pinch and Karin Bijsterveld 
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KARIN BIJSTERVELD 


i. Introduction 


We were sitting in the Tudor Arms, Stockholm, recen tly voted as “the best British 
pub in the world” — outside of Great Britain. With oak-paneled walls, wooden 
beams, nooks and crannies, warm beer, and special British culinary delights such as 
steak and mushroom pie, the Tudor Arms is a perfect emulation of a pub. We were 
with two friends, Thomas and Otto, one Swedish, one Germán. The discussion was 
getting heated; the topic was the SoundEar. 

The SoundEar is a special device found in many kindergartens in Scandinavia. 
It is a box that displays an outline ear with green glowing LEDs that can be attached 
to a classroom wall. The SoundEar measures sound. 1 When the noise in the class- 
room is at or below a predetermined level, the ear glows green. When the noise 
reaches 10 decibels above this level, the inner part of the ear glows amber; if the 
noise continúes at this level or above for longer than ten seconds, the ear will glow 
red. The kids have been warned: Be quiet! 

Otto had encountered this device when his child first entered Swedish kinder- 
garten. He had noticed that the kids had shut up as soon as the ear turned amber. 
For Otto, who was used to rowdy Germán kids and their teachers’ use of more tra- 
ditional means of keeping order, it was a telling example of the difference between 
Swedish and Germán society. In public places the Swedes liked to be “civilized.” 
Even on the train home to the suburbs of Stockholm, where he lived, he had 
observed older school kids monitoring the behavior of younger kids and telling 
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them to pipe down if they were too loud. Thomas quietly asserted that Otto was 
missing the point. The SoundEar was a safety device that had been introduced into 
all Swedish schools by the caring State to protect the hearing of school employees. 
This attention to noise in the workplace was actually a sign of Swedish progressive - 
ness. Of course, it might be a means of discipline, but Thomas pointed out that this 
is what hard hats at a building site do and surely Otto wouldn’t want more head 
injuries. Otto wasn’t buying this argument, however. He claimed that the kids at 
first took no notice of the SoundEar and that their natural inclination was just to 
talk louder as their excitement rose. Thomas conceded that the SoundEar some- 
times didn’t work properly or was switched off in the realization that teachers 
sometimes just couldn’t stop the kids from being kids. As Otto and Thomas argued, 
we realized that the pub had become almost silent; everyone was looking our way. 
The Tudor Arms was a perfect emulation of a British pub except for one thing. In 
Britain the noise gets louder as the evening wears on and more and more beer is 
consumed. In Sweden that crescendo never comes! 

We wondered, was this the future of pubs? Once the pub would have been full 
of cigarette smoke; now pubs everywhere (except for Berlín, as Otto proudly pointed 
out) are smoke free. Will new laws and policies eventually be enacted to control 
sound in public spaces? Will one day machines like the SoundEar glow everywhere? 
And the SoundEar is starting to glow in other places. The largest market for sales is 
hospitals in the United States. Patients who are coming out of anesthesia are espe- 
cially sensitive to sound. Furthermore, as Hillel Schwartz points out in this volume, 
hospitals, which were once islands of silence, are getting noisier and noisier, and 
one of the major contributors to the increased cacophony are technologies, whether 
the humming ofventilators or the beeping of monitoring equipment. The SoundEar 
helps maintain a quiet environment for certain patients. Researchers are also explor- 
ing the use of the SoundEar during surgery: It has been claimed that a percentage of 
surgical error arises from unwanted noise in the operating room. 

This true story 2 of our encounter with the SoundEar captures some of the main 
themes in this book. Modernity has brought about developments in Science, tech- 
nology, and medicine and at the same time increasingly new ways of producing, 
storing, and reproducing sound. Sound is no longer just sound; it has become 
technologically produced and mediated sound. This allows it to be more easily 
transformed, or “transduced,” the term used by some of the authors in this book. 
Transduction turns sound into something accessible to other senses. The SoundEar 
turns sound into sight. Of course, we should not forget (as Jonathan Sterne and 
Mitchell Akiyama point out in their chapter), that the human ear itself is a trans- 
ducer, turning sound into vibrations that then become nerve impulses that are 
registered in the brain. Yet now that technologies of transduction are everywhere, 
we would like to foreground their appropriation and consequences in Science, 
society, and culture as important topics for study. 

Sound is no longer produced only by humans and nature, for machines roar 
everywhere and technologies not only measure sound in a myriad of new ways but 
also produce and emulate sounds, such as in video games and movies. New sources 
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of sound such as the ubiquitous iPod and cell phones demónstrate that sounds have 
become personal and mobile. New sounds — never heard before, such as the sound 
of industrialization, the gasoline engine of the automobile, and electronic sounds — 
have entered what Murray Schafer has so felicitously called “the soundscape” 
(Schafer 1994/1977). 

Sounds can be captured in new ways, such as with the parabolic microphone 
used to record bird sound, and stored in new ways, such as with the first such 
device, the recently rediscovered phonoautograph, and the better-known phono- 
graph, which is used not only for music but also as a new scientific tool for the 
Science of ethnology. Sounds that could not be heard before can now be heard for 
the first time, such as the sound of atoms, which lies at the heart of the newly 
invented scanning probe microscope, one of the key instruments of the new field of 
nanotechnology. Digital technologies today provide ever-new ways of storing, 
manipulating, and transferring sound and music. The recording of music has been 
transformed (and is still transforming) what it means to make and experience 
music. Sounds can be reproduced in new sorts of places, such as under water. 
The establishment of a whole new Science of sound — acoustics — has led to new 
instruments, new ways of measuring, conceptualizing, and controlling sound, 
and new contributions to music. With the development of the new field of acousti- 
cal architecture, buildings and rooms can be specially designed for their sound 
qualities (Thompson 2002). We interact with sound in new ways. Scientists turn to 
sonification — innovative technologies and techniques for rendering scientific data 
into sound. 

However, new ways of interacting with sound are also part of everyday life. 
Consider, for instance, the humble radio dial, which for many people at the dawn 
of radio opened up a global world accessible only by sound. One of the more 
significant advances in the history of medicine has come through sound. The ability 
to listen to the body via the stethoscope provides doctors a powerful new means 
of diagnosis (Lachmund 1994, 1999). In modern terms, the stethoscope enables 
them to do an “audification” of bodily phenomena. Sound continúes to transform 
medicine through technologies such as ultrasound (Casper 1998). Novel forms 
of hearing aid that directly stimulate the brain mean that deaf people can almost 
miraculously start to hear again. 

The consequences of all of this are vast. Sound becomes more materially 
mediated in a whole host of newfangled ways. Sound becomes more “thinglike” — 
a commodity to be bought and sold on iTunes, a thing to be worn, as with personal 
stereos. Sound becomes a means in itself to sell and market goods. Sound can not 
only be listened to but also be measured, regulated, and controlled (Thompson 
2002; Bijsterveld 2008). It can even become an important part of political dissent as 
with subversive listening to the radio and making of tapes in former Communist 
countries, which Trevor Hagen and Tia DeNora discuss in this volume. 

For us, Science, technology, and medicine are the keys to unlock these new 
worlds of sound. Science, technology, and medicine do not only — intentionally or 
unintentionally — create novel sources of sound but also provide us with innovative 
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tools for using sound and theories about it. Which sounds have been produced, 
captured, stored, and transferred by Science, technology, and medicine? By which 
means? How have society and culture appropriated these sounds and means, and 
how have scientists, engineers, and doctors themselves listened to the objects, 
machines, and bodies they study — with or without the help of sonic equipment? 

These questions have informed our decisión to organize the twenty-three chap- 
ters that make up this handbook around the different sorts of places where sound is 
experienced. Some of these are the exemplary sites of Science, technology, and med- 
icine, such as the laboratory, the test site, the design studio, and the clinic. Such sites 
are populated by experts, test subjects, and carefully monitored patients and objects. 
The other sites we have chosen — the workshop, the field, the home — are much 
more open and accessible to nonexperts. By stressing this we do not mean to say 
that “society and culture” are not to be found in labs and clinics, unlike in fields and 
homes, but that different sites have different gatekeepers and that this affects how 
sounds are listened to, talked about, and given meaning. In the earlier sections these 
sites tend to be well-defined circumscribed environments, but in the later sections, 
particularly sections VI and VII, we encounter more diffuse environments. Section 
I deais with how machine sounds are encountered on the shop floor and at the test 
site. Section II covers what we cali the field, including the ocean, and Section III 
covers the laboratory. Section IV takes the clinic as its location, and Section V the 
design studio. Section VI focuses mainly upon the home but also ineludes karaoke 
bars, clubs, concert halls, and Internet cafés. Section VII is the most diffuse in terms 
of location as it focuses upon digital storage of sound, and, of course, as such devices 
become ever smaller they can be played back in a myriad of places. In addition, as 
the tifies of our sections suggest, we explore how at these sites sounds have been 
reworked (literally and metaphorically), staged, spoken for, edited, consumed, and 
moved from one place to another. We stress, however, that this ordering does not 
preelude alternative routes through our volume. You can work through the chap- 
ters in order or dip in and out on your own self-directed journeys through sound. 


2. Now That Sound Is in the Air 


Our focus on Science, technology, and medicine as the keys to unlock the worlds of 
sound unsurprisingly helped to attract authors with a background in the history 
and sociology of technology and Science, as well as in Science and technology stud- 
ies (STS). Nonetheless, we could not have made this book without contributions 
from scholars who are working on the history of the senses, cultural history, anthro- 
pology of medicine and the body, media studies, film studies, game studies, and 
musicology. 

At the same time, we do not claim that this book fully covers “sound studies.” 
In 2004 we defined sound studies as “an emerging interdisciplinary area that studies 
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the material production and consumption of music, sound, noise, and silence and 
how these have changed throughout history and within different societies” (Pinch 
and Bijsterveld 2004: 636). Although we still endorse this definition, the word 
emerging may now be slightly too modest. Sound studies has become a vibrant new 
interdisciplinary field with many different, yet often overlapping, strands. Among 
the areas involved are acoustic ecology, sound and soundscape design, anthropol- 
ogy of the senses, history of everyday life, environmental history, cultural geogra- 
phy, urban studies, auditory culture, art studies, musicology, ethnomusicology, 
literary studies, and STS. New fields of study always come with competing defini- 
tions of what should be studied, and sound studies is no exception. Each strand 
conceptualizes its topic and thus the reality it constructs as its proper subject in 
slightly different ways. 

Acoustic ecology has its roots in the environmental concerns of the late 
1960S and the 1970S. In those years Raymond Murray Schafer, Barry Truax, and 
Hildegard Westerkamp established their World Soundscape Project at Simón Fraser 
University, Vancouver. Their approach was and still is a highly original mix of rais- 
ing environmental awareness of “our sonic environment and the tuning of the 
world,” mapping past and present soundscapes, endorsing psychoacoustic research 
that reaches beyond the individual, and contributing to a higher-quality sonic 
environment through composition and sound design (Schafer 1967, 1969, 1977, 
1994/1 977; Truax 1978, 2001; Westerkamp 1974). Their allies and heirs are to 
be found among today’s innumerable projects that record and play the soundscapes 
of contemporary cities or intervene in such soundscapes by adding sound art. 
In addition, their work has been cited and paraphrased so often that a notion 
like soundscape as sonic environment, area of aural study, or composition is 
now the solé focus of publications that trace its roots, discuss its merits and short- 
comings, and elabórate on alternative definitions (for an excellent overview see 
Kelman 2010). 

Closely related to acoustic ecology are the emerging fields of soundscape design 
and sound design. While soundscape design is dominated by architects and sound 
artists (Arkette 2004; Augoyard and Torgue 2005; Martin 1994; LaBelle and Roden 
1999), sound design is largely the domain of engineers, designers, and marketers 
(Langemaier 1993; Ózcan Vieira 2008). Such sound designers measure the sound- 
level emission of a consumer good, attempt to curtail or silence its noise, add spe- 
cifically designed sound to it, study the perception of these sounds, and design the 
sound of media producís such as films or video games (Collins 2008; Grimshaw and 
Whittington, this volume). Acoustic ecology and soundscape and sound design 
together have been the feeder disciplines for imaginative interdisciplinary sound 
studies programs. The sound studies research and education activities coordinated 
by Berlín cultural theorist Holger Schulze are an example of this new alignment. 
In his work the theory and history of auditory culture, which draws on acoustic 
ecology and historical anthropology, are combined with experimental sound 
design — including sound art — sonic branding, and audio production. It explicitly 
incorporates a “new materialism” that studies sound as both technical-physical 
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emanation and artistic-aesthetical imagination, and as a “tangible and rich subject 
of our experience, feelings and thoughts” (Schulze 2008, 11; Spehr 2009). This strand 
of research thus overlaps with more positivistic approaches to sound studies. Sonic 
interaction design, for instance, focuses on the “exploitation of sound as one of the 
principal channels conveying information, meaning, and aesthetic/emotional qual- 
ities in interactive contexts.” It combines an interest in interaction design with the 
auditory display of information, “sound modeling,” sound perception, cognition 
and emotion, and sound and music computing. 3 

Without the interactive media available today, such initiatives would not have 
been possible, and it is therefore not surprising that the field of media studies has 
made a significant contribution to sound studies. Yet, while sonic interaction design 
starts out in perception psychology and the communication Sciences, most of this 
work in media studies is firmly rooted in the humanities, qualitative sociology, and 
cultural studies. This is true for the more historically oriented works on sound and 
media technologies such as the phonograph, gramophone, telephone, tape recorder, 
and radio (Badenoch 2008; Douglas 1999; Fickers 1998; Fischer 1992; Gitelman 1999, 
2003; Haring 2008; Katz 2004; Morton 2000; Schiffer 1991; Weber 2008). It is also 
true for the contributions by sociologists and media theorists on the meaning and 
use of both these older and the more contemporary sound, media, and communi- 
cation technologies such as the Walkman, iPod, cell phone, and film (Agar 2003; 
Alderman 2001; Birdsall and Enns 2008; Bull 2000, 2007; Goggin 2006; Lastra 2000; 
Sterne 2003; Wurtzler 2007). Indeed, several of these authors, who often combine 
an interest in media studies with the history of technology, are included in the pres- 
ent volume. 

The World Soundscape Project has been rightfully credited for coining the 
word soundscape and putting sound studies on the map. One should not forget, 
however, that without the silent assistance of long-standing scholarly traditions 
such as the anthropology of the senses, the Annales school of history, cultural his- 
tory, environmental history, and musicology, the study of sound and soundscapes 
might have had a much more problematic reception in the humanities (see also 
Smith 2010). The studies of the smells and sounds of everyday French Ufe by Annales 
historian Alain Corbin not only brought the most intangible aspects of our past to 
the attention of historians and others but also contributed to the notion that modes 
of listening and other sensory experiences are as historically variable as many other 
aspects of life (Corbin 1986/1982, 1995, 1999). Cultural histories of reading, speech, 
and sound sensitized our knowledge about who was allowed (or not allowed) to 
speak or to make sounds and when and where this occurred (Bailey 1996; Burke 
1993; Cockayne 2007; Johnson 1995; Payer 2006; Picker 2003; Rath 2003; B. Smith 
1999; M. Smith 2000, 2001, 2002, 2003, 2004; Rée 1999). Environmental historians 
did a similar yet more governance-oriented job by tracking down the rise of noise 
abatement regulation (Coates 2005; Smilor 1980; Saúl 1996a, 1996b), while cultural 
geography and urban studies have informed us about the spatial distribution of 
noise and its political implications (Rodaway 1994; Revill 2000). Studies by anthro- 
pologists and historians on the hierarchy, meaning, and employment of the senses 
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in different societies substantiated arguments for cultural differences in sensory 
experience (Classen 1997; Parr 2001; Stoller 1989). With a journal ( The Senses and 
Society ) and introductory volumes (Howes 2005; Jütte 2005; M. Smith 2007), sen- 
sory studies is now an established field in its own right. These introductory volumes 
inelude studies of “auditory culture” (Bull and Back 2003) and “hearing cultures” 
(Erlmann 2004) and feed into museum exhibits on the history and ethnography of 
sound and noise (Gonseth 2010; Smith, this volume). 

The fields of art studies, musicology, and ethnomusicology have recen tly wid- 
ened their scope to inelude some sound studies. This probably refleets the shifting 
interest among students from classical music to popular music and from the skills 
necessary for playing traditional musical instruments to those involved in electronic 
and digital recording, as well as sampling, composing, and consuming music 
(DeNora 2000; Kahn 1992, 1999; Kraft 1996; Kelly 2009; Lysloff and Gay 2003; 
Perlman 2004; Greene and Porcello 2005; Théberge 1997; Schmidt-Horning 
2004; Stadler 2010; Chanan 1995; Taylor 2001). Perhaps as a sign of the times, the 
Department of Music and Center for Ethnomusicology at Columbia University in 
2009 organized a conference, “Listening In/Feeding Back: Listening and the 
Circulation of Sound Media,” which brought together scholars from sound studies 
and the more traditional fields of musicology and ethnomusicology. Moreover, the 
rise of recording has stirred a subspecialty within musicology that has been coined 
phonomusicology: the study of recorded music (Cottrell 2010, 15; Philip 2004; Day 
2000). 

Cióse neighbors of ethnomusicology and phonomusicology are new musicol- 
ogy and radical musicology, which have fully digested cultural studies approaches 
(Rodgers 2010). Often such work also refers to more literary approaches to sound. 
Literary studies examine the ways that literary writers textualize sound, hearing, 
and listening, express the sounds of their times, imbue sound with meaning, and 
evoke noise (for an overview see Bernhart 2008). Some of their publications center 
on textualized sound in experimental poetics, poetry performances, and poetry 
(Morris 1997). Others analyze references to specific forms of hearing and listening 
in literary texts, such as eavesdropping (Gaylin 2002) or mishearing (Connor 
2009) — although Connor’s humorous talk on mishearing reaches far beyond 
literature. Highly insightful are the studies that unravel expressions of sound in 
Shakespearean theater (Folkerth 2002) and in twentieth-century plays (Meszaros 
2005), as well as textualizations of noise in late nineteenth- and twentieth-century 
literary texts. Michael Cowan (2006), for instance, has written wonderfully on 
Rainer Maria Rilke’s resistance to worldly noise. and Philipp Schweighauser (2006) 
intelligently unravels the noises of naturalist, modernist, and postmodernist 
American literature in his “history of literary acoustics.” 

Textualization of sound is again a transformation of one sensory experience 
into another. This and the rise of sensory studies might lead to the question whether 
one sense — hearing — deserves its own field and indeed its own handbook, but this 
would be to misconstrue the senses involved in sound studies. Although sensory 
studies in general has contributed to the rise of sound studies and sound studies is 



10 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


clearly part of sensory studies, we feel that sound studies, especially when taken in 
the direction of Science, technology, and medicine, raises a far wider range of issues. 
We have already pointed out that as sound becomes more materially embedded, 
it becomes mediated by many senses, including seeing and touch. It is exactly this 
relationship between the material embedding and multisensory mediation of 
modern sound that intrigues us and has led to our key questions. 

Sound studies thus is a flourishing interdisciplinary area with several overlap- 
ping disciplines and a range of methods that touch upon the fields of acoustic ecol- 
ogy, sound design, urban studies, cultural geography, media and communication 
studies, cultural studies, the history and anthropology of the senses, the history and 
sociology of music, and literary studies. We are not the first authors to tie in some 
of these wider concerns with the specific fields of history and sociology of Science 
and technology and STS. Hans-Joachim Braun, a historian of technology, has long 
been an advócate of studying music and sound technologies within the context of 
the history of technology and has organized many sessions on such themes at inter- 
national meetings of the history of technology. This development led to Braun’s 
important edited collection, “I Sing the Body Electric \ # Music and Technology in the 
2 oth Century (Braun 2000, 2002). Emily Thompson, another scholar whose roots lie 
in the history of technology, has shown in her authoritative book, The Soundscape 
ofModernity (Thompson 2002), how the Science and practice of acoustical architec- 
ture developed in parallel with twentieth-century concerns and changes in technol- 
ogy and how sound was experienced. Jonathan Sterne, a scholar from media studies, 
draws upon STS in his important book, The Audible Past , which deais with how, in 
the twentieth century, listening changed in concert with technologies such as the 
telegraph and the stethoscope (Sterne 2003). 

Our own interest in sound studies should also be mentioned here. One of us, 
Trevor Pinch, stumbled into the field in the course of writing his book on the his- 
tory of the Moog electronic music synthesizer (Pinch and Troceo 2002). Pinch, 
a professor of Science and technology studies who has a background in the sociol- 
ogy of Science and technology, was initially interested in linking STS to music and 
held sessions called a STS Faces the Music” at an international Science studies con- 
ference at Bielefeld, Germany, in 1996, in which Karin Bijsterveld also participated. 
Eventually Pinch teamed up with Bijsterveld to pursue this theme in research on 
musical instruments (Pinch and Bijsterveld 2003) and with a special issue of 
Social Studies of Science : “Sound Studies: New Technologies and Music,” published 
in 2004. Pinch first taught a course in sound studies at Cornell University in 
2004, while Bijsterveld started a course called Sound Technologies and Cultural 
Practices at Maastricht University the same year. She has a background in history, 
musicology, and STS and has contributed to the history of noise with Mechanical 
Sound: Technology , Culture , and Public Problems of Noise in the Twentieth Century 
(Bijsterveld 2008), our knowledge of the everyday use of audio technologies with 
Sound Souvenirs: Audio Technologies y Memory , and Cultural Practices (Bijsterveld 
and Van Dijck 2009), and our understanding of acoustic cocooning in the car 
(Bijsterveld 2010). 
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3. Listening for Knowledge 


The new audio technologies discussed in this book, such as the phonograph, gram- 
ophone, radio, cassette player, and digital audio equipment, have not only drawn 
on earlier instruments used in Science, engineering, and medicine but also contrib- 
uted to the processes of knowledge creation in these fields (Brady 1999; Kursell 
2008; Stangl 2000). Such contributions have never been self-evident, however. 

If there is any field that is associated with seeing rather than with hearing, it is 
Science. Scholars who emphasize the visual bias in Western culture even point to 
Science as their favorite example. Because doing research seems impossible without 
using images, graphs, and diagrams, Science is — in their view — a visual endeavor 
par excellence. Historians and sociologists of Science have recently corrected this 
claim by showing how senses other than seeing, including listening, have been 
significant in the development of knowledge, notably in the laboratory. They stress 
that scientific work involves more than visual observation. The introduction of 
measurement devices that merely seem to require the reading of results and thus 
seeing has not ruled out the deployment of the scientists’ other senses. On the con- 
trary, scientific work in experimental settings often calis for bodily skills, one of 
which is listening. We’ll come back to these views later. The world of Science 
itself, however, still considers listening a less objective entrance into knowledge 
production than seeing. Why? 

One of the aims of our book is to offer readers a better understanding of this 
contested position of sonic skills — by which we mean listening skills and the skills 
needed to employ the tools for listening — in knowledge production. Several con- 
tributors to this handbook focus specifically on the role of sound and listening in 
Science, technology, and medicine. This ineludes fields that have acoustic phenom- 
ena as their exclusive topic and domains that examine other subjeets with the help 
of sound. Most of these contributions focus on the period from the 1920S on, thus 
taking the rise of recording technologies into account. Our readers can find these 
contributions in the first four sections of our book, which focus on listening on 
shop floors and at test sites, in the field, in labs, and in clinics, respectively. While 
Stefan Krebs, Eefje Cleophas, and Karin Bijsterveld unravel the listening practices 
of engineers and mechanics, Joeri Bruynincks, Julia Kursell, Myles Jackson, Cyrus 
Mody, and Alexandra Supper discuss those of scientists, and Tom Rice and Mara 
Mills those of physicians. Moreover, Stefan Helmreich and some of the authors just 
mentioned also show what artists take from scientists when dealing with sound. 

Three questions in particular are of concern here. 4 The first is how scientists, 
engineers, and physicians have employed their ears in making sense of what they 
have studied. How did they listen to the objeets, machines, and bodies they studied? 
What tools did they use? And how did they acquire their sonic skills? The second 
question is how such listening practices, regardless of whether or not mediated by 
recording and amplification technologies, have generated new scientific knowledge, 
technological designs, and medical devices. In short, did listening elicit new 
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questions and findings in these fields, and if so, what kind of listening? The final and 
most crucial question is why listening has nonetheless remained contested and still 
lacks the same legitimization given to other means of knowing. In other words, 
under which conditions have sonic skills been accepted as an “objective” means of 
inquiry alongside visual ones in Science, engineering, and medicine? In some cases, 
why have visual skills partially replaced sonic skills? Why, for instance, is “sonifica- 
tion,” that is, the auditory (in contrast to the visual) display of scientific data still 
deeply contested today? 

These questions have almost naturally evolved from a particular set of issues 
predominating the historiography of the senses and STS for quite some time. To 
many scholars, as we have already suggested, Science is the most telling example 
of the visual orientation of Western culture (Attali 1985; Berendt 1985). To the 
indignation of such scholars, even acousticians illustrate their work with slides and 
charts rather than with sounds (Schafer 1994/1977). Others claim that the numerous 
visual metaphors for knowing in Western languages (“Yes, I see”) proves the 
visual bias of Western culture (Tyler 1984). Comparisons with cultures that have 
epistemologies based on alternative sensory orientations, such as auditory ones, 
rhetorically underline this argument (Feld 2003). 

For many years, the discussion boiled down to the issue of when , exactly, 
Western culture had turned from privileging hearing to preferring sight. Did it 
occur with the rise of print (Bailey 1996)? Yet Constance Classen has claimed that 
the standard ranking of the senses in Western culture, in which sight occupies 
the highest position, preceded print culture by many centuries (Classen 1997). 
Furthermore, other scholars have stressed the continuing significance of sound for 
people’s everyday spatial and symbolic orientation in modern times (Corbin 1999, 
Bull and Back 2003). As Jonathan Sterne has clarified, the search for an overall shift 
in the sensory selectivity of the West unjustly assumed that the history of the senses 
should be “a zero-sum game, where the dominance of one sense by necessity leads 
to the decline of another sense” (Sterne 2003, 16). Consequently, many academics 
today argüe instead for a “perceptual equilibrium” that has been present since at 
least the later medieval period (Woolf 2004). 

For a long time, however, the idea that modern Science is a visual affair remained 
undisputed. The big picture was that profound changes in natural philosophy in 
the sixteenth and seventeenth centuries — including the new reliance on observa- 
tion and experiment as legitímate sources of knowledge, as well as the growing 
importance of print as a vehicle of academic communication — had given rise to a 
situation in which scientific knowledge increasingly carne to be expressed in visual 
terms. “Carne to be expressed” happened to be an important qualification, though. 
The empirical turn in STS moved scholars away from formulating universal criteria 
that demarcated rational Science from irrational beliefs and shifted their focus from 
overidealized conceptions of Science to the everyday practices of Science. 

This change did not result in debunking the significance of visual routines in 
Science per se. On the contrary, the new approach fine-tuned the understanding of 
the historical processes and sociological mechanisms that had made visualizations 
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the most widely accepted forms of scientific representation. Yet the empirical turn 
also created a heightened awareness of the contingencies in the skills used to acquire 
data , both the visual skills and those related to the other senses. As to the issue of 
representation, sociologist of Science Bruno Latour showed how inscriptions such 
as tables and diagrams, because they were both immutable and mobile, effectively 
circulated locally acquired data across geographically widespread networks of 
knowledge (Latour 1986, 1987). Moreover, publications at the intersection of the 
history of Science, media studies, and art history added insights into the shifting 
forms of visual display and the educational entertainment that preceded the spread 
of visual literacy (Stafford 1994; Tufte 1997). 

At the very same time, scholars in STS began studying what actually happened 
in the laboratory before scientists sent out their inscriptions. These highlighted the 
subtle interconnections between the experimenter’s body, the use of scientific 
instruments, and the outcomes of laboratory work. Ethnographic lab studies 
revealed that experimental skills involve so much tacit knowledge that the exact 
replication of experiments is, in fact, extremely difficult to attain and could lead to 
an infinite “experimenters’ regress” (Collins 1985). Collins based the notion of tacit 
knowledge upon the work of Michael Polanyi (1967), considering it “an embodied 
kind of know-how irreducible to symbolic terms” (Mody 2005, 176). In addition, 
historians of Science produced studies of the intricate relationships between research 
instruments and the content of the knowledge generated (Hankins and Silverman 
1995; Schaffer 1999). A more nuanced account of visual instruments, for instance, 
led to the idea of an “externalized retina” that structures the cognitions of scientists 
(Lynch 1990). Similar in approach, studies of nonverbal “visual thinking” and 
materially located “situated actions” in engineering acquired salience (Ferguson 
1992; Suchman 1987). 

Closely in line with this type of research and echoing the rise of the history and 
anthropology of the senses, the study of the role of senses other than seeing in 
Science, engineering, and medicine gained momentum. We have already mentioned 
Lachmund’s work (1994, 1999) on the rise of the stethoscope and auditory skills in 
nineteenth-century medicine and how this related to knowledge of pulmonary 
diseases (see also Duffin 1998). Others showed how technicians and engineers 
listened to photocopiers (Orr 1996) and cars (Borg 2007) to detect flaws in the 
mechanisms, as well as how in materials Science the sounds of laboratory instru- 
ments and data informed researchers of the quality and content of the experiments 
(Mody 2005). 

Such work does not imply that all scientific work draws on the senses. Many 
Sciences depend on the use of mathematics and logics that rely less directly on sen- 
sory skills. Moreover, many historians of Science accept the claims of Lorraine 
Daston and Peter Galison (1992, 2007) on the rise of “mechanical objectivity.” As 
they have shown, the moral of mechanical objectivity in Science gradually gained 
significance, reaching its zenith in the 1920S. This moral dismissed the human mind 
and body as trustworthy witnesses of natural phenomena in favor of the registra- 
tion of such phenomena by machines. Yet as Lissa Roberts (1995, 519) has argued, 
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eighteenth-century chemists publicly sidelined touch, hearing, smell, and taste and 
in fact transformed “their bodies into appendages of a machine.” 

To understand the role of the senses in knowledge dynamics, it is thus useful to 
distinguish between the different sites at which scientists, engineers, and physicians 
perform their work: in behind-the-doors laboratories and out-of-sight fields, in the 
faetones and hospitals, where professional engineers and doctors examine indi- 
vidual machines and bodies, as well as at the conferences at which they present their 
results in scientific papers. These different sites are all represented in the chapters 
that follow. 

Across these sites of knowledge production, different modes of listening 
are involved, such as monitory, diagnostic, exploratory, and synthetic listening. 
Monitory listening is the kind of auditory surveillance that scientists, engineers, and 
physicians employ in order to check the proper functioning of instruments, 
machines, and patients’ bodies (Mody 2005; Bijsterveld 2006). Diagnostic listening 
refers to the mode of listening that physicians apply to identify pathologies when 
using a stethoscope (Lachmund 1994, 1999; Rice and Coltart 2006) and that engi- 
neers use to detect the origin of calculation mistakes in computers by amplifying 
their sound (Alberts 2000, 2003). Whereas monitory listening is used to determine 
whether something is wrong, diagnostic listening reveáis what is wrong. Exploratory 
listening is listening to discover new phenomena. Susan Douglas (1999) used it to 
refer to the way in which early amateur radio operators searched the ether for radio 
stations. The use of sound recording technologies by ornithologists to identify new 
bird species is a similar form of exploratory listening. Synthetic listening , finally, 
focuses on the understanding of polyphonic patterns of sound, such as in the 
sonification of scientific data (Dayé and de Campo 2006). 

All of these modes of listening require specific skills. These are not only skills 
for understanding what exactly one is listening to but also technical skills linked 
to the listening devices used, such as positioning a stethoscope properly on a 
patient’s body or effectively handling a magnetic tape recorder. Even musical skills 
may be required as, for instance, when engineers single out the different “melodies” 
and “rhythms” of humming computers. This explains why we prefer to speak of 
sonic skills rather than auditory skills. 

The chapters that follow have produced highly contextualized insights into the 
fundamental issue of sensory selectivity in the production and validation of scien- 
tific knowledge and technological design. Rather than proclaiming a victory of the 
visual in Science, pleading for the emancipation of hearing at the expense of seeing, 
or defending a perceptual equilibrium, the chapters investígate when, how, and 
under what conditions the ear has contributed to knowledge dynamics in tándem 
with or instead of the eye. All of the authors have their own particular take on these 
issues, colored by the scholarly traditions they draw from. We do not want to give 
away all of their conclusions. We do, however, want to flag a few remarkable issues 
that return in many of the chapters, albeit under different headings. 

The first issue concerns the contexts in which an interest in sound leads to the 
acquisition of knowledge of objeets, machines, bodies, and environments — what 
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Steven Feld has called “acoustemology” (Feld 2003) and Tom Porcello has retuned 
into “technostemology” for technologically mediated forms of acoustemology 
(Greene and Porcello 2005). Apart from the availability of instruments that have 
enabled or assisted listening, there seems to be a clear relationship with military 
activities — detecting “the enemy” with the help of sound — or, in contrast, the wish 
to leave such a military focus behind, such as in reconversión activities (Mody, this 
volume). Another relevant context involves efforts to aid those who are blind 
or deaf (Supper and Mills, this volume). Moreover, the epistemological status of 
sound and the attitude toward acoustic instruments or practices seem related to the 
dominance of particular sensory-related crafts and instruments within certain dis- 
ciplines or medical practices, such as the microscope in biology or the stethoscope 
in medicine (Mody and Rice, this volume). 

Another intriguing issue is that in their attempts to make sound “speak” to 
them and to inform them about the topics they study, scientists, engineers, physi- 
cians, and test subjects struggle with the verbalization or description of sound. To 
compare sounds — whether those of car engines, birds, hearts, sonar, or sonified 
data — and to delinéate even the tiniest differences, knowledge makers need a sound 
vocabulary and a sound language — in short, a way to talk about sound. In doing so, 
they not only use dictionary words for sound, such as “roaring,” “whistling,” or 
“crackling” sounds, but also draw upon analogies with the sounds of musical instru- 
ments, the human voice, and everyday artifacts. Creating a sound vocabulary is a 
prerequisite for making a “sound Índex,” “sound codification,” or “sound taxon- 
omy”: a way of distinguishing and classifying sounds. Subsequently, the taxonomy 
enables “sound mapping”: the assignment of sounds to the information they 
represent, as Stefan Krebs explains in his chapter. 

He and other authors nicely illustrate the various modes of listening in medical, 
engineering, and scientific contexts, such as monitory listening or sonic surveil- 
lance (Rice, this volume), diagnostic listening (Krebs and Rice, this volume), and 
exploratory listening (Fickers, this volume). The empirical evidence in these chap- 
ters also implies, however, that it is helpful to distinguish conceptually between the 
intent behind the listening, such as in our initial definition of monitory or diagnos- 
tic listening, and the more perceptual or epistemological characteristics of listening, 
such as in sound mapping. 

Also remarkable are the many ways in which the sonic and the visual interfere 
with one another. Some of the chapters here describe instances of conversión or 
“transduction” of sonic information into visual information; we have already 
alluded to these phenomena. Examples are recording sound by way of music nota- 
tion and onomatopoeia or with the help of instruments such as the kymograph or 
the sound spectrograph. More important, these chapters also explain why conver- 
sión took place and what effects resulted. Other chapters illustrate how even the 
listening experience itself has been visualized, such as through the design of the 
radio dial (Fickers, this volume) or how visual conceptualizations of sound, such as 
the “cióse up,” entered the world of sound recording in Science (Bruyninckx, this 
volume). 
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Finally, we want to share with our readers the observation that tinkering, repair- 
ing, and do-it-yourself practices in research and art seem to be sociologically related 
to the tendency to adapt audification, sonification, and careful listening, to — in 
other words — some sort of sensory sophistication and sensory flexibility. There is a 
link in this respect between Krebs’s mechanics, Mody’s acoustic microscope people, 
Jackson’s, Helmreich’s, and Supper’s tinkerer-composers, and Mill’s technophile 
cochlear-implant test subjects, which they share with Marc Perlman’s audiophiles 
(2004) and Kristen Haring’s amateur radio operators (hams) (2008). 


4. New Sources and Means 
of Capturing, Storing, and 
Reproducing Sound 


This book covers many new and oíd sources of sound, some intentional and some 
unintentional. Science, technology, and medicine themselves constitute one of the 
biggest new sources of unintentional sounds. The machines of American industrial- 
ization such as the new weaving machines used in Lowell, Massachusetts (Smith, 
this volume), or the industrial machines found in Nazi Germany, the former DDR, 
and West Germany (Braun, this volume) were not deliberately designed to be 
noisy — although extra noise may on occasion also be symbolic of power. 5 It just so 
happens that enormous engines or industrial machinery housed in enclosed spaces 
such as faetones produce sound as a byproduct of their operation. Even in hospi- 
tals, medical technology is a source of often unintentional sound, whether it is 
the buzz of the ventilators of life-support systems or the beeps of more routine 
technologies such as digital thermometers (Schwartz, this volume). This is in con- 
trast to, say, the intentional sounds of music, which, via radio and newly developed 
loudspeakers, was piped into factories in the twentieth century as part of the 
“music while you work” movement also described by Braun (this volume). 

There is of course nothing “natural” about machines having to produce noise 
(Bijsterveld 2008). Once we understand the adverse health consequences of these 
noisy machines, we can design them to be quieter. The lack of naturalness of 
machine noise is nicely demonstrated by the sound engineering of automobiles 
(Cleophas and Bijsterveld, this volume), where a car company such as Porsche will 
employ acoustic engineers precisely to engineer the “throaty” roar of the Porsche 
engine to make it more appealing to its customer base — a sound that interestingly 
appeals differently depending upon the gender and nationality of consumers. 

Unintentional sounds produced in new sorts of locations, such as factories, 
hospitals, and laboratories, can lead to the reconfiguration of space. One of the 
scenes Braun evokes is the workers testing some of the loudest diesel engines found 
in the Rostock diesel factory in the DDR in a special soundproofed room that they 
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have constructed because of the health-threatening aspects of the sounds. Sound 
and acoustic architecture can together subtly change space in new designs of build- 
ings and concert halls (Thompson 2002). Urban space can also be reconfigured by 
mobile listeners with the use of personal stereos and iPods (Bull, this volume). 
Geographical space too, can be dramatically changed as with the case of the early 
radio dial, which was for many listeners their gateway to different cities, regions, 
and countries (Fickers, this volume). 

Science, technology, and medicine are themselves important sources of inno- 
vation for new sonic technologies and instruments. Sounds that have never been 
heard before can be listened to through new devices and pieces of equipment such 
as the stethoscope, which we have already mentioned. Interestingly, in the early 
history of the automobile a form of stethoscope was also developed specially for 
listening to car engines (Krebs, this volume). New sound reproduction equipment 
such as hydrophonic headphones and special loudspeakers enabled people to listen 
to sounds under water, thus facilitating a whole new form of art (Helmreich, this 
volume). New medical devices such as cochlear implants have helped deaf 
people or those with a severe hearing impairment almost miraculously to hear again 
(Mills, this volume). 

New ways of producing and capturing sounds also change the way we mediate 
sound. For instance, in the early days of radio, listeners 5 experience of geography 
was mediated by the tactile feel and visual organization of the radio dial (Fickers, 
this volume). The new parabolic microphone allowed bird sound to be captured 
but required a truckload of equipment to accompany it, thus mediating the sorts of 
“natural 55 environments where bird sounds could be recorded (Bruynincks, this 
volume). With new instruments and techniques, new sorts of skills and tacit knowl- 
edge can also emerge (Rice, Krebs, and Kursell, this volume) and may involve new 
regimes of training, as well as the new types of listening skills, which we discussed 
earlier. 

Science, technology, and medicine offer new ways of transforming or “trans- 
ducing 55 sound. The transformation of sound to another material médium enables 
it to be more easily stored and transported, such as with phonograph cylinders, 
player piano rolls, vinyl records, tapes, cassettes, compact dises, and later a vast 
array of digital storage and transmission media. This is an important point for 
sound studies to digest and lies at the core of the STS approach to sound. 

The strength of Science studies is in its dealing with materiality, the senses, cul- 
ture, and politics within the same analytical register. Special pieces of equipment 
that render the external world into visual traces that are in turn reproducible and 
transportable are known within STS as “inscription devices 55 (Latour and Woolgar 
1979). In sound studies we are dealing with an even wider range of transformations 
or conversions between different senses and different media (for instance, the under- 
water sounds described by Flelmreich). Scholars in this volume employ terms such 
as transduction (Helmreich) or conversión (Mody) to describe such processes. 
Indeed, from the perspective of sound studies it is possible to see the “inscription 
device 55 as a rather limited, purely visual form of a much more general class of 
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devices for transforming the external world into reproducible and transportable 
sensory traces. 

The increasing movement toward sonification has produced a range of new 
ways of rendering scientific data into the sensory realm (Supper, Sterne and 
Akiyama, this volume). The transformation to another media or material also pro- 
duces the possibility of new forms of sound storage such as the phonoautograph 
(Sterne and Akiyama, this volume), the phonograph (Kursell and Katz, this volume), 
the player piano (the punched paper holes can even be thought of as an early means 
of digital storage) (Katz, this volume), and the current digital revolution aligned to 
computers, software, and the Internet in the ways that sound and music are pro- 
duced, stored, and consumed (Grimshaw, Whittington, Katz, Fouché, Pinch and 
Athanasiades, and Bull, this volume). Sometimes new inscription devices, such as 
the kymograph, 6 invented by Germán physiologist Karl Ludwig in the 1840S, can be 
combined with new sound technologies such as the phonograph to provide new 
tools for scientific investigation (Kursell, this volume). Another example is the 
combination of the new parabolic microphone for recording bird sound with the 
audio spectrogram to advance scientific studies of birdsong (Bruyninckx, this 
volume) 

With these new ways of transforming, storing, and reproducing sound come 
interesting conceptual issues about how the “transformed” sounds are experienced. 
For instance, with the development of the phonograph, new sorts of musical idioms 
became popular because of phonograph recording techniques, such as overcom- 
ing the sonic limitations of the bullhorn microphone, by singers “crooning” or 
excessive vibrato on a violin (Katz 2004). Indeed, such musical effects, rather than 
being seen as artifacts of the recording process, started to sound “real,” and per- 
formers began including them in their live performances — a process known as 
phonorealism. With the digitization of sound, new sorts of sonic experiences have 
become possible, such as the “hyperrealism” of sound and the “immersion” of the 
listener in the sonic experience of modern movies and video games (Whittington 
and Grimshaw, this volume). Novel technologies, such as digital samplers and 
synthesizers, enable sounds to be specifically “designed” to enhance visual stimuli 
in movies (Whittington, this volume) or to sell commercial producís (Taylor, 
this volume). Technology, it can be argued, is creating original sorts of pheno- 
menal experiences, such as with cochlear implants — a foretaste of what some futur- 
ists claim will be life increasingly emulating video games (Mills, this volume). 
Psychological States are more and more linked directly to sounds, whether these 
States are the audiotopia sought after by iPod listeners (Bull, this volume) or involve 
the monitoring of EEGs linked to sound sources, which has been proposed as a new 
way of making video games even more sonically immersive (Grimshaw, this 
volume). However, the significance of these transformations between the senses is 
even deeper for the field of sound studies. As Sterne and Akiyama argüe, as the 
senses themselves become ever more technologically mediated, they become more 
“plástic,” and this challenges standard accounts of individual senses that one 
typically encounters in histories, anthropologies, and cultural studies. 
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The new technologies of sound have had a dramatic impact on the arts. The 
invention of the electronic music synthesizer has been described as one of the major 
musical transformations of the twentieth century (Pinch and Troceo 2002). The 
boundaries between what is natural and what is synthetic in music has always been 
contested. For example, are the keys on a ilute a “mechanical” interference or an 
integral part of a “natural” instrument (Pinch and Bijsterveld 2003)? What makes a 
musical instrument an instrument as opposed to a machine or piece of technology 
(Pinch and Troceo 2002)? What counts as music and what counts as noise have, of 
course, been fiercely contested (Kahn 1999). Unintentional sounds such as the 
sound of wind made by an Aeolian harp or the sounds heard in John Cage’s famous 
4 , 33 // challenge our notion of music. New sounds, such as that made by a scratch on 
a vinyl record, can be a source of innovation for a whole new genre of music such 
as in hip-hop (Katz and Fouché, this volume). New instruments developed in the 
lab, such as the metronome, siren, and tuning fork, can become sources of musical 
inspiration that lead to new compositions (Jackson, this volume), as can the instru- 
ments taken over from war-time sonar used to make underwater music (Fíelmreich, 
this volume). Digital software enables users to transform their personal computers 
into recording studios and leads to new genres of music such as “remixing” and 
“mash-ups” (Katz, Pinch, and Athanasiades, this volume). Sometimes the new 
sonic technologies and the sorts of sound they produce lead to desires to acquire 
and experience earlier ways of producing and listening to music, such as the 
“technostalgia” involved in the turn to vintage electronic instruments (Pinch and 
Reineker 2009) and the desire for authentic vinyl scratching practices in hip-hop 
(Fouché and Katz, this volume). 

The border crossings between the arts and Science documented in this book 
reveal again that essentialist definitions of instruments and their proper domains are 
unnecessarily restrictive. “Follow the instruments” is the methodological heuristic 
heard in this volume (Pinch and Bijsterveld 2004). Instruments developed in Science 
and engineering can become part of the arts and the commercial world and vice versa 
(Kursell and Jackson, this volume). The lack of one meaning of scientific results and 
technologies is referred to as “interpretative flexibility” within STS (Collins 1985; 
Shapin and Schaffer 1985; Latour 1987; Bijker, Hughes, and Pinch 1987). The role of 
users is crucial here as well. Users, such as Grand Wizard Theodore, who is reputed 
to have invented the hip-hop vinyl scratch, have in effect redesigned the technology 
of the record player for an entirely new use in music making (Eglash et al. 2004). 
Within STS we describe this as the “interpretive flexibility” of technology once more 
appearing but in this case in the context of use (Kline and Pinch 1996; Oudshoorn 
and Pinch 2003). This book is fxill of such instances, including the remarkable exam- 
ple of one the earliest methods of magnitizdat found in Eastern Europe during the 
Soviet era, documented by Trever Hagen and Tia DeNora (this volume). The emul- 
sión on discarded X-rays from the 1950S in the Soviet Union provided a material that 
could be engraved as a record. The production and distribution of these homemade 
records became known as roentgenizdat , or “playing the bones.” This case and the 
hip-hop one add a political valence to these user practices. 
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Running like a bass ostinato throughout the transformations, travels, and ter- 
minations undergone by sound in this volume are indeed the politics of sound. 
A common struggle is which professional group gets authorized to use which 
sound technologies and where (Krebs, Katz, Supper, and Rice, this volume). The 
politics of sound crops up in the status of the amateur in music (Katz, Pinch, and 
Athanasiades, this volume), the struggles over racialized and gender identities 
(Smith, Braun, Schwartz, Fouché, Pinch, and Athansiades, this volume), and the 
sorts of subversive listening and music practices encountered during the Coid War 
in Eastern Europe (Hagen and DeNora, this volume). Earlier struggles over noise 
abatement (Thompson 2002; Bijsterveld 2008; Braun, this volume) and new strug- 
gles over the possible damage to hearing from new devices such as earbuds (Bull 
and Schwartz, this volume) remind us that sound, as in the story of the SoundEar, 
is part of the lived politics of everyday life. 


5. Chapter Summaries 


One of the most enduring sources of unintended sound is natural phenomena, 
such as thunderstorms, waterfalls, the ocean, volcanoes, and the sounds made by 
the creatures that inhabit the natural world. This concern is central to Mark SmitlTs 
opening chapter of the Shop Floor Section, where he reconsiders the “machine in 
the garden” thesis of industrialization in the United States, made famous by Leo 
Marx. FIow was the sound of nature experienced, and how did it contrast with the 
new sorts of industrial sounds found in factories? Furthermore, how did the sounds 
of the new industrial soundscape contrast with the sound of another means of pro- 
duction — the more silent and agrarian use of slavery in the South? 

What machine sounds mean to listeners and how they respond to them cru- 
cially depend upon historical context, as Smith shows in his chapter. Hans- Joachim 
Braun writes about a very different period, Germany in the Nazi era and the “two 
Germanies” after the Second World War. One of the most intriguing questions he 
discusses is how the differing Germán political regimes and their dissimilar com- 
mitments to the rights and safety of workers have actually dealt with industrial 
noise. 

The sound of the early automobile engine in Germany from the 1930S to the 
1950S is the topic of Stefan Krebs’s chapter. Here a community of listeners arises 
who specialize in diagnosing a car’s problems from the sound of its engine, gearbox, 
and so on. This community turns out to be transient. First, it consists of the 
car owners themselves, who listen to the car — a form of listening that Krebs calis 
“listening by driving,” but later (by the 1950S) it is the specialized garages and auto 
mechanics who do most of the listening — a practice that over time largely vanishes 
in favor of other techniques of diagnosing automobile problems. Again, issues 
of “diagnostic listening” are important, but Krebs also offers a new theoretical 
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addition with Michel Foucault’s notion of a “dispositive” to link these listening 
practices and their demise in the wake of wider power and discourse issues in 
society at large. 

Cleophas and Bijsterveld write mainly about car sound and how European 
manufacturers in the 1990S tested what consumers wanted in that regard. They 
focus largely on the interior of cars and how even things such as the “crackle” of 
leather upholstery should come with the “right sound” for a particular make or 
model. The tests carried out reveal that laypeople’s vocabulary of sound and their 
listening skills do not always match those of expert engineers. They extend STS 
approaches toward understanding testing to this new sonic environment. 
Negotiating a common shared meaning — the outcome of a test — is even harder in 
the context of different European cultures. Moreover, Cleophas and Bijsterveld 
draw upon Gerhard Schulze’s notion of Erlebnisgesellschaft , or “experience-driven 
society,” to explain why the car industry began caring so much about sound 
design at all. Schulze argües that since many producís have been perfected, sensory 
experience plays an ever-greater role in their marketing and sales. 

The sound of nature — in this case, birds — is a central feature of Joeri 
Bruyninckx’s chapter on how the recording of birdsong has evolved and become 
more scientific since the late 1920S. He focuses upon ornithologists in the United 
Kingdom and the United States and shows how special microphones and devices 
for turning sound into visual traces, such as the vibralyzer, the oscillograph, and the 
audio spectrograph (taken over from wartime uses of sonar to search for subma- 
rines), have played a crucial role. Often trips into the wild to record the sounds of 
birds were stymied by the large mobile studios needed to house the recording equip- 
ment of the day. Part of his story is the role played by people in the movie industry, 
who were often the first to develop new sound technologies for places such as the 
well-known Cornell Lab of Ornithology. Interestingly, later on in another very dif- 
ferent sonic context we find scientists returning the compliment by contributing to 
the movie industry. Bill Whittington writes in his chapter about how Cornell Com- 
puter scientists in collaboration with companies such as Pixar are developing new 
ways to digitally realize sounds for movies. Although for the purpose of organiza- 
tion we have divided this book into sections such as the Laboratory, Field, and 
Studio, the actors studied do not always respect or follow such categories. As 
Bruyninckx points out, although we have placed his chapter in the field section, the 
field, lab, and studio become “blurred spaces” in his study. Perhaps as this is a 
sounds study book we could better say that the different sites (as in the recording 
studio) suffer “leakage.” 

The sound of nature does not feature much in other chapters although the 
médium of water is an important theme in Stefan Helmreich’s chapter on under- 
water music. Helmreich shows how the explorers of underwater music learned 
much from military underwater technologies such as sonar and the development of 
hydrophonic headphones. New technologies of loudspeakers are also required to 
produce underwater sounds. There is fluid movement between the Sciences and the 
arts. Helmreich is one of the first authors in the handbook to raise the issue of 
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transduction — sound that is usually experienced in air must be experienced in a 
totally different médium, water, and this requires special devices: “transducers.” 
What does it mean to “transduce” or transform sounds from one sonic médium or 
sense to another, and what is lost and gained in the process? 

Julia Kursell shows in her chapter how the phonograph and the kymograph 
were used by Cari Stumpf at the Berlín Institute of Psychology in the early twentieth 
century as a tool for the scientific investigation of sound and hearing. Stumpf 
explored the ways in which these new instruments could be used for the study of 
language. For instance, he recorded and replayed sounds, such as human vowels, at 
different speeds. In 1908 Stumpf also established a phonograph archive that con- 
tained examples of musical use from all over the world as part of an ethnologically 
inspired endeavor. At the same time, the archive provided a basis for the new field 
Science of comparative musicology — which today we would cali ethnomusicology. 
Kursell uses Germán media theorist Friedrich Kittler to rethink what is involved in 
the standardization of a media technology. 

Myles Jackson also reminds us that sound travels between the arts and 
Science in his investigation of several devices developed in the nineteenth century 
by scientists and acousticians interested in better understanding musical phenom- 
ena such as pitch. He documents the invention of the tuning fork, the siren, and the 
portable chronometer (the latter became the metronome) and the different uses 
they acquired later in musical composition and performances in the twentieth cen- 
tury (the tuning fork and metronome, he notes, also became important instru- 
ments in nineteenth-century physiology). This traveling across disciplines and 
indeed between the Sciences and the arts in general makes following these acoustic 
and sound technologies even more salient. As these instruments get used in new 
ways in new contexts by new groups of users, we start to hear a richer story of how 
human creativity and invention occur. While bricolage and tinkering are going on, 
their sounds are calling out to be listened to. What is at one moment a scientific 
instrument can later become a new musical instrument. Similarly, a musical instru- 
ment such as the phonograph can become part of a scientific laboratory, as Julia 
Kursell details. The world of sound has for too long been walled off — it is an impor- 
tant part of the means and methods on which the world of human ingenuity and 
creativity thrive. 

At the start of his chapter, Cyrus Mody again draws attention to the issue of the 
transformation of sound by the community of probe microscopists, who developed 
many of the new sorts of instruments that are crucial to nanotechnology. Mody 
uses the term synesthetic conversión , meaning conversión and reconversión between 
any of the senses, and points out that probe microscopists were attempting to 
engage the haptic dimensions of the practice, as well as the audio and the visual. 
Mody’s chapter is largely about the wider context in which this community of probe 
microscopists emerges at specific locations, such as Stanford University during the 
1960S. He shows how this group was also embroiled in the “reconversión” of 
American academic research from military funding and applications to civilian 
funding and an orientation to “human problems.” Mody thus shows how the 
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specific technical concerns and practices of the scientists he studies were linked to 
the wider social and political context in which they worked. 

The transformation of scientific data into auditory information is part of a 
wider process known as sonification. In her chapter, which concludes the section on 
the laboratory, Alexandra Supper looks at the recent history of this emergent field. 
Researchers in the field must not only find a way to reduce the residual bias toward 
the visual but also convincingly legitimize the various sonification techniques, often 
linked to music and the arts,to make this a discrete field in its own right. No one 
describes “visualization” as a sepárate field of Science, so what warrants a field of 
“sonification”? The search for the “killer application” that Supper documents 
is further testimony to its current striving for professional status. Supper follows 
sonification through different contexts in great detail by studying its discursive for- 
mations and strategic demeanor and its attempts to professionalize. Part of the 
novelty of her work lies in her focus upon conference presentations by scientists — 
an area neglected in much of the history and sociology of Science. 

Hospital sounds are discussed in the chapter by Hillel Schwartz. Drawing upon 
an astounding range of sources, including literary greats such as Homer and Kafka, 
he details the history of the silent hospital, starting with Florence Nightingale’s 
experiences in the Crimean war. Schwartz considers hospital architecture and the 
ever-increasing encroachment of noise in parallel with the history of another audio 
technology — the earplug. He traces its use particularly through the First World 
War and up to the noise-canceling headphones of today. It is the conjunction of the 
technology of personal sound (and its prevention) and the institutional sound of 
the hospital that gives Schwartz’s chapter its tensión. Just as in the social Sciences, 
where we wrestle with a concern for both individuáis qua individuáis and wider 
social formations such as groups and institutions, so, too, with sound technologies: 
Is sound an institutional problem to be treated by reforming hospitals, or is it up to 
individuáis to wear sound devices to protect themselves? The wearing of individual 
devices, of course, soon gets coupled with issues of stigmata and gender (the “man- 
liness” or not of such devices) as Schwartz details and which Braun and Smith in the 
very different context of industrial noise also note. 

Another important source of sound is the human body. The process known as 
auscultation , which is the topic of Tom Rice’s chapter, concerns the sounds of inter- 
nal organs such as the heart and how they can be listened to with a new device, the 
stethoscope. Interestingly, Rice was drawn to this topic while carrying out a project 
similar to Schwartz’s, namely a study of how patients in a modern hospital experi- 
enced sound. Rice’s study is one of the most anthropological in the volume as he 
details his own “ears-on” experiences as he learned auscultation from a practitioner 
in a leading London teaching hospital. We have already mentioned diagnostic 
listening skills, and this chapter explores them in detail. It discusses issues such as 
how the sound of a heartbeat can help medical practitioners make a diagnosis, how 
stable the categories it produces are, and how reliable the technique itself is, espe- 
cially when other medical contexts (such as that in the United States) play down 
auscultation in favor of heart monitoring by echocardiography (which uses another 
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sonic technology, ultrasound, to produce a sonograph of the heart). Rice not only 
is concerned with how the listening skills are acquired and passed on but also notices 
how listening to the body requires medical practitioners to acquire special bodily 
skills and postures as they learn to position their bodies in the best way to employ 
this specialized form of listening. He thus engages with the discussion on tacit skills 
in this volume. 

Another chapter in the section on clinics discusses a new device that utilizes 
sound technology, the cochlear implant, which enables deaf people or those with a 
severe hearing impairment to hear again. This controversial topic — some members 
of the Deaf community welcome cochlear implants, whereas others reject them as a 
violation of “Deaf culture” — has been treated within STS before where there is a 
growing literature on disability studies (Blume 2010; Haraway 2008). Mara Mills 
describes the cochlear implant as the most common neural-computer interface in 
the world with more than two hundred thousand users. She sets her chapter within 
the context of neuroenhancement and its futurist discourse and offers a new history 
of cochlear-implant development by focusing upon the previously neglected topic 
of the subjects actually used in the research. By treating the subjects as “users,” she 
shows how their concerns sometimes get incorporated and sometimes ignored in 
the design of these devices, which must also accommodate economic concerns and 
the pressures of the wider medical establishment. One thus gains a sense of the new 
forms of subjectivity that these devices can enable. Mills intriguingly quotes media 
theorist Vilém Flusser, who found that his own deafness and a hearing aid gave him 
more control as to what he chose to hear — a form of “ear lid” that enabled him to 
better navigate his immersion in the world’s noises and voices and permitted him 
to “see through” the programming of his own hearing aid and thus both “see” and 
hear better. 

Immersion is an important theme in the handbook and no more so than in the 
section on the design studio. Often work on the phenomenology of sound, such as 
Don Idhe’s important early investigations (Idhe 1976, 2007), starts by pointing out 
the difference between listening and seeing. The visual field appears as a form of 
screen surrounded by darkness in front of our eyes, but we experience sound all 
around us — in short without the aid of “ear lids” we are immersed in sound. The 
issue of sound immersion particularly comes to the fore in the design of video 
games, which is the theme of Mark Grimshaw’s chapter. Grimshaw, who is himself 
a game player and designer, shows how, in video games, the players have enormous 
control over where in the game the character they opérate is at any moment; in 
addition, the sounds must respond appropriately (for instance, movement must 
generate the sound of footsteps). Furthermore, these sounds may convey important 
information to the player, such as footsteps coming from behind, signaling a threat. 
Because of the vast range of sounds that need to be generated in real time, the sonic 
experience in video games is also much more contingent upon the sound-generat- 
ing technology available. Grimshaw’s chapter shows how technological constraint, 
the users’ own imaginative experiences, the designers’ goals, and the visual material 
offered to the user all work together to produce a constructed immersive experience. 
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Grimshaw ends his chapter by pointing to an as yet unrealized possibility in games 
whereby sound itself might respond to the affective State of the player as measured 
by psychophysical sensors such as an EEG. In such a scenario one could imagine a 
frightening sound being made even scarier because the player did not yet seem to be 
frightened enough! This more complete enhanced immersive experience bears 
an uncanny similarity to the futurist scenarios Mills discusses in her chapter, 
whereby brain implants and neuroenhancers will, according to some futurists, 
make ordinary life appear more and more like a video game. 

In contrast to Grimshaw’s chapter on video design, where the perspective of the 
player and the designer form the core of the analysis, Bill Whittington, a film 
scholar, considers the emergence of a whole new way of designing sound for film 
and a specialized company (Pixar) associated with it. Pixar, of course, is famous for 
having developed the computer-generated animation methods that replaced the 
extremely labor-intensive way that cartoons used to be made. Whittington shows 
that part of the story of this revolution in film production is a sound story. Film 
sound underwent a digital transformation of its own with the introduction of new 
sound formats and the rise of the sound design movement, which involved new 
sound technologies such as synthesizers and samplers. Indeed, Whittington argües, 
the technological innovations and new aesthetics that Pixar developed, including 
the use of sound techniques from live action movies, gained credibility in part 
because of sound. Whittington focuses mainly on the early animated short films 
that Pixar made. Important is the development of the technique and the aesthetic 
of what he calis “hyperrealism,” where designers created stylized constructions that 
accessed familiar sound events yet recombined these effects to create new sound 
impressions. For instance, the sound of a dog barking in Toy Story was made by 
combining the sound of a barking dog with that of a tiger. Whittington unpacks the 
cinematic codes embedded in the new methods and traces the relationships among 
the complicated networks of production companies, sound houses, and designers 
in the San Francisco Bay area, which preceded Pixar and led to its eventual success. 
With the current success of 3D productions such as Avatar , Hollywood is still 
undergoing dramatic changes, and, as Whittington notes, the increasing turn to a 
“spectacle of sensation” means that the immersion experience of film is drawing 
ever closer to that of the video game technologies discussed by Mark Grimshaw in 
his chapter. 

In the early 1960S, commercial electronic music was first becoming realizable. 
Tim Taylor, an ethnomusicologist, in the final chapter in the section on the design 
studio documents the early use of electronic sounds, such as those of the newly 
invented Moog synthesizer, to sell producís such as beer and coffee. Taylor focuses 
on two of the best known U.S. commercial musicians of the period, Raymond Scott 
and Eric Siday, and dissects in great musicological detail their most famous ads. He 
contrasts Scott’s and Siday’s early efforts with those of a 1980S’ composer, Suzanne 
Ciani, who was famous for a Coke ad realized on her Buchla synthesizer, which 
evokes the kind of hyperrealism Whittington discusses. In the ad, the sound 
of Coke being poured is much more evocative than the sound of a the real thing.” 
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One can speculate that just as for the 1950S, when movies and ads led to the growing 
acceptance of electronic sounds, hyperrealism as a form of listening — perhaps a 
new sonic skill — was conveyed via the popular médium of ads. Taylor’s chapter 
is interesting when read beside the earlier chapter by Cleophas and Bijsterveld on 
selling car sounds. In their case the car sounds help sell the product, and in Taylor’s 
case the hyperreal sound of the product sells the product. 

In the opening chapter on the section on the home, Andreas Fickers tells us 
about early radio and in particular the role played by the iconic radio dial. For many 
people who grew up in the 1950S (as one of the editors of this book did), the dial 
glowing on the radio on the mantelpiece provided the first window into a global 
world — a world where listeners in Europe learned for the first time of the impor- 
tance of, for instance, Hilversum, Holland (always featured prominently on such 
diais). Fickers details technical developments in early radio whereby it became pos- 
sible to easily tune in stations — a precondition for the radio dial. Flowever, the 
weight of his chapter is on spectrum-allocation issues that the International 
Broadcasting Union mediated in the 1920 and 1930S as different national radio 
interests negotiated nothing less than the layout of the dial. Fickers thus bridges the 
detailed listening habits in the home and the wider world of regulatory authorities 
to provide an intriguing account of the material, institutional, and symbolic aspects 
of this crucial feature of a single audio technology. 

Radio and the elicit tuning-in of stations feature in Trever Hagen and Tia 
DeNora’s chapter on the role of sound technologies in Hungary and Czechoslovakia 
during the 1960S and 1970S as growing opposition to the official Communist regimes 
and their state-sanctioned music practices developed. We learn fascinating details 
about this world, such as that the sound of the Soviets’ attempts to jam Western 
stations was referred to by listeners in Czechoslovakia as “Stalin’s bagpipes.” They 
use anthropologist Víctor Turner’s notion of a liminal space (defined by Turner as 
“betwixt and between”) to describe the sorts of nonofficial practice in which 
their subjects engage. The listening and performances that Hagen and DeNora 
document oíten take place in prívate spaces away from the prying eyes of the 
State. We learn how technologies such as radio, records, and cassette tapes 
were all important in the development of new forms of subversive practice and 
also how the sound technologies were in turn shaped by these practices. In Hagen 
and DeNora’s chapter the twist is that these subversive listening and musical prac- 
tices help coproduce a sort of unofficial antistate as users learn how to develop a 
“disposition” to challenge the official State. 

Mark Katz shows the continued importance of what might be described as 
“amateurs.” He starts by examining the use of the phonograph and the player piano 
and puts paid to the myth that the invention of these devices killed off amateur 
music making. Indeed, the story he tells is the converse: These new devices actually 
facilitated and encouraged amateur music making, and manufacturers often 
responded to these new users by modifying the devices to facilítate more new uses. 
Katz shows that technologies such as phonographs and the player piano employed 
devices to adjust tone, tempo, and loudness, along with the performance, so that 
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the listener was far from being passive. He notes that, years before the digital 
médium aróse, there was a lively tradition of home recording with the phonograph, 
with dubbing and the making of what we would today cali “mash-ups.” Other ama- 
teur uses discussed by Katz are the development of karaoke in Japan (one of the few 
sections of this handbook to deal with music in Asia), the ways users adapt the 
video game “Guitar Hero,” and, as mentioned already, the birth of hip-hop. 

In the concluding chapter to the section on the home, Trevor Pinch and 
Katherine Athanasiades examine a specific form of digital musical community that 
has arisen with the advent of the Internet. These amateur musicians, who form a 
worldwide community, post their musical compositions to a special website, where 
their music can in turn be downloaded and reviewed by other users and which 
offers a chart position for every piece of music posted. Pinch and Athansiades show 
how online reputations at this site are established with the help of yet another form 
of transduction, one between sounds and music and the words and symbols that 
popúlate online ranking systems. They also show that, although the website does 
open up new possibilities for musical identities and forms of collaboration, many of 
the identities (including gendered identities) and processes found there actually 
bear more resonance to existing offline musical practices and identities. 

The last section of the handbook contains three chapters that deal with the new 
possibilities of moving sound that digital storage offers. However, Rayvon Fouché’s 
chapter, which opens this section, starts off in the analog world. The origin story of 
the vinyl record “scratch” is well known. At first an unwanted and annoying aspect 
of vinyl record listening, in the hands of a skillful DJ, “scratching” became part of 
the new music making of hip-hop. Special tools and technologies and eventually 
digital scratching systems evolved as Fouché discusses. He points to the crucial role 
played by hybrid digital-vinyl systems, which allows DJs to control digital music 
with specially encoded vinyl records and turntables, thus preserving some of the 
authentic craft of analog scratching while seeming to embrace the latest digital 
advances. Rather than having to schlep heavy vinyl records around, DJs can now 
have all their music conveniently nearby, stored on a digital Computer. The prac- 
tices of digital scratching and the use of vinyl have, however, remained in tensión 
within the hip-hop community. Fouché’s chapter is one of the few in the volume 
where issues of race and ethnicity and audio technology are interconnected. He 
documents how Grandmaster Flash played an important and often-neglected role 
in outlining the technical specifications for one of the mixers used by DJs. 

The sound of the urban soundscape forms the background to Michael BulFs 
chapter on iPod listeners. The iPod (along with other MP3 players and cell phones) 
is one of the most pervasive new sound technologies ever introduced. As Bull (2000, 
2007) has shown, personal stereos have profound consequences not only for how 
users’ psychological States can be mediated, modulated, and controlled but also for 
how they experience their external environment, including space and social inter- 
action. In this chapter Bull argües that users must navigate between an extreme 
State of enjoyment, which he calis audiotopia, and the opposite State, audio toxicity — 
where iPod users lose all sense of their grounding (or, as Bull calis it, “tethering”) 
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in the surrounding social world and even risk physical damage to their hearing. Bull 
draws upon a set of theoretical concerns from cultural studies to show how internal 
experiences and mental States are coproduced or coconstructed with particular new 
audio technologies. 

The final chapter in the volume, which is by Jonathan Sterne and Mitchell 
Akiyama and takes Scott’s phonoautograph as its topic, raises a fascinating set of 
issues having to do with oíd ways of storing sound and modern digital methods of 
retrieving and recovering it. The phonoautograph received worldwide attention in 
2008 after scientists managed to play back some of the songs recorded by Scott in 
1860, thus producing what was claimed to be the “world’s oldest recording.” Sterne 
and Akiyama note that Scott rather belittled Edison’s phonograph as it merely 
played back sound. The phonoautograph was a new way to write sound — in other 
words, to render sound into a visual form. The chapter thus further explores the 
topic of sonification, taken up earlier by Supper. Sterne and Akiyama develop the 
cultural studies idea of “articulation,” which they describe as an approach that is 
more attentive to issues of power than other constructivist approaches such as the 
social construction of technology or actor network theory. Their conclusión is a 
dramatic one for sound studies. The examples of sonification they discuss demón- 
strate that data intended for one sense can be readily transformed into another, and 
they claim this shows the increasing plasticity of the senses. Their goal here is to “lay 
bare the degree to which the senses themselves are articulated into different cul- 
tural, technological, and epistemic formations.” In short, according to Sterne and 
Akiyama, it is no longer possible to treat particular technologies or cultural forms 
as predestined for or determined by a single sense. In a way, the topic of a sensory 
history, of anthropology, or of cultural studies has according to them been 
dissolved. This offers a radical challenge for the future! 


NOTES 


1 The device is made by the SoundShip Company in Copenhagen and has won a 
Danish Design Council award. It has an interesting history: Before becoming adapted for 
classroom purposes, the device was first made with the idea of protecting musicians from 
damaging their ears. 

2 There actually was such an incident experienced by one of the editors (Trevor 
Pinch). The story has been slightly simplified for our purposes here. We are grateful to 
Otto Sibum for drawing our attention to the SoundEar; he and Thomas Kaiserfeld have 
graciously allowed us to use their friendly disagreement in this book. 

3 Http:// www.cost-sid.org/ (accessed Sept. 2 , 2010). 

4 These questions also guide the Maastricht research program “Sonic Skills: Sound 
and Listening in Science, Technology, and Medicine, 1920S-N0W,” coordinated by 
Karin Bijsterveld and funded by a Dutch NWO VICI Award. 

5 For instance, American-made vacuum cleaners are reputed to be designed to be 
louder than similar machines in Europe and Asia because Americans associate a loud 
cleaner with more suction power. See John Seabrook, “How to Make It,” New Yorker , 
Sept. 20, 2010, 66-73. 
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6 The kymograph consisted of a rotating drum covered with paper, over which a 
Stylus moved back and forth recording a physiological variable such as blood pressure. 
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CHAPTER 1 


THE GARDEN IN 
THE MACHINE: 
LISTENING TO 
EARLY AMERICAN 
INDUSTRIALIZARON 


MARK M. SMITH 


Introduction 


Go to http://www.lowell.com/museums.boott-cotton-mills-museum, “Your Guide 
to Historie Lowell, Massachusetts,” and you, as a virtual visitor, will be greeted with 
the following: “One entire floor of the Boott Cotton Mills Museum shows visitors 
exactly how a working mili actually looks. The floor is called the ‘Weave Room,’ 
and it is filled with industrial-grade looms, running at top speed, allowing visitors 
to feel the buzz of a working mili. This is one of the largest industrial history exhib- 
its in the nation, and it is probably one of the loudest — it’s not called the Toar of 
industry’ for nothing!” 1 

Loudest. Roar. Buzz. These are the key words that people generally, historians 
included, deploy to capture and render simple and digestible what was, in fact, a 
very complicated, situated, and highly contextualized process: industrialization in 
the United States. On the one hand, of course, historians of aurality should be 
pleased that their preferred way of excavating the past — listening — occupies such a 
prominent position in historical understanding. As a great deal of work has shown, 
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historians and general historical consciousness is prone, for a variety of reasons, to 
read and understand the past through the eyes. Certainly, histories of industrializa- 
tion are often indebted to visión — the visual rise of the faetones, the slicing of 
erstwhile rural landscapes by impossibly tall chimney stacks, the visual power of 
the smoke, sparks, and sheer scale of the early factories eliciting plenty of visually 
indexed primary evidence. Nevertheless, of all of the principal developments 
usually associated with the coming of modernity in the nineteenth century, 
industrialization was understood, especially by contemporaries but also by some 
historians, as very much an aural affair. 

Yet the way in which historians have chosen to frame the aural history of indus- 
trialization, especially in the context of early nineteenth-century America, is argu- 
ably quite misleading; it is easily understood, interpretatively portable and exportable 
(especially to living museums and their customers), certainly, but is it really accu- 
rate? Does it convey the full texture and, critically, the context in which people, 
especially those on the shop floor, heard and listened? In this chapter I suggest first 
that historians in the United States have tended to ignore the aural history of early 
industrialization (most commentaries, themselves quite rare, tend to latch on to the 
sounds and noises of large-scale, mature industrial America, mainly in the late 
nineteenth century, when industry was becoming a behemoth and, as such, occu- 
pied a more obvious place in the soundscape). I also maintain that they have tended 
to slip too easily — and misleadingly — into a binary understanding of the transition 
to industrialism, which makes artificial distinctions between a “quiet” countryside 
and a “loud” factory shop floor that would not always have been apparent to the 
people who, in fact, experienced the transition. I stress the preeminent importance 
of context in that transition. Factory workers in Lowell, Massachusetts — the first 
industrial laborers in the United States — not only understood their laboring expe- 
rience in terms of aurality but, critically, also mediated that aural experience within 
a larger context. That context involved increasing sectional tensions between a free- 
wage labor North and a slave South, Romanticism, and the imperatives of a factory- 
based paternalism that focused very much on the gender of the Lowell workforce. 

It is something of a commonplace among historians of aurality — including 
me — to argüe that hearing is qualitatively different from seeing because we have 
eyelids and no ear lids. For this reason, so the argument goes, sound has a more 
transgressive quality than sight. While this may be true, the insight tends to raise 
questions. How, for example, do people who find themselves quite suddenly 
immersed into qualitatively and quantitatively new acoustic environments — such 
as with the first generation of female factory workers in the United States — 
incorpórate the new, invariably loud sounds into their worldviews and sound- 
scapes? What were the precise mechanisms that mediated, in other words, the 
transition from rural life to industrial life? Sound, noise, silence — habits of listen- 
ing — were central to that transition, and, in fact, the prevailing cultural New 
England norms of the time concerning the meaning and experience of sound were 
central to effecting what was, for the most part, a smooth transition to industrialism 
for the first generation of factory workers. In other words, the cultural backdrop of 
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religious, natural, and sectional sound helped grease the transition to industrial 
sound so that the sounds of the shop floor were deemed just that: sound, at worst 
noise that was necessary, but rarely simple, intolerable noise. Context, in other 
words, matters a great deal when trying to listen to the aural experience of the first 
industrial workers in the United States (Smith 2007: 41-57). 


The Machine in the American Garden 


What we might cali the conventional aural history of industrialization in the ante- 
bellum United States rests on a basic and enduring interpretive binary that stresses 
that the preindustrial quietude of pastoral America became overwhelmed by ever- 
louder industrialism. Quiet countryside versus loud factory is certainly an intui- 
tively attractive way to think of the transition not least because it fits, almost too 
neatly, something of a whiggish teleological mándate. It also happens to bear the 
imprimatur of some key historians, most notably Leo Marx, whose seminal 1964 
study, The Machine in the Garden , has influenced an en tire generation of historians 
in the United States and, in fact, has impacted the work of scholars of hearing. For 
Marx, the shriek of the locomotive especially functioned as an actual and symbolic 
disruption of the pastoral ideal by positioning the pastoral and the industrial in ten- 
sión with the sounds of the factory, which were ruthlessly penetrating and disrupt- 
ing the countryside. For Marx, the pastoral increasingly functioned as an escape 
from the ravages — aural included — of modernity (Marx 1964; Bijsterveld 2008, 

50-51). 

While Marx’s metaphor works very well for transgressive technologies, such as 
the railroad, which literally pierced the countryside, it not only is less effective for 
explaining the experiential aspects of work in the first milis but also fails to take full 
stock of just how braided pastoral sounds were with machine sounds in those fae- 
tones. In other words, it cannot take account of that very first wave of industrializa- 
tion, a wave in which the country — in the form of rural women — carne to the 
factory. Capturing the complexity of the sounds of the first factory shop floors in 
the United States requires a careful ear, one attuned to the larger context in which 
industrialization occurred and that takes the words and life experiences of Lowell 
workers seriously and arrives at an understanding of the way that pastoral sounds 
were cobbled onto and braided with factory sounds. 

I use the term braided deliberately. No process — even one as iconic and as 
sonic as early industrialization — sweeps clean. Everyday life — lived experience — 
tends to foster attitudes that import elements of our memory in an effort to 
manage the experience of the present and cope with anticipated futures. In this 
sense, transitions are just that: processes, lived and woven. They are temporal 
fabrics, an ever-evolving cloth, replete with patterns, contours, and textures under- 
stood by contemporaries but whose threads are sometimes difficult for the historian 
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to discern. Here, listening to how early factory workers understood their experience 
helps reveal, in unusual relief, those threads, the process of the weaving, that daily 
braiding. In other words, the heard worlds of women factory workers grant us 
unusual access to the complexity of their experience. Braiding, then, functions as 
both apt metaphor and descriptor. 

Now, to what extent the experience of female workers in Lowell was excep- 
tional or unusual is a matter worthy of future research, and I can only gesture 
toward answers here. To some extent, all instances of early, first-wave industrializa- 
ron, regardless of location, involved transitions from agriculture to industry with 
machines making their way into gardens of ancient stock. Braiding, metaphorical 
and literal, was probably inherent to the process and sometimes by design. English 
steam peppered nineteenth-century Prussian gardens, at once elaborating national 
ideáis but also giving voice to the technological sublime; everywhere in the indus- 
trializing world, railroads snaked through the countryside; and the first generation 
of factories in England and the United States drew from riparian sources, nature’s 
oil greasing and powering people’s machines (Roberts 2000, 55-58; Wise 1999). It is 
also the case that even in maturing industrial societies — as was the case in late nine- 
teenth- and early twentieth-century America — rural eastern European immigrants 
found themselves thrust into powerfully and disconcertingly large and cacopho- 
nous factories. However, as (albeit limited and largely literary) evidence suggests, 
these second-generation industrial laborers, unlike their early nineteenth-century 
Lowell counterparts, found their pastoralist and bucolic metaphors challenged by 
the sheer scale, loudness, and alienating noises of some of the United States’ largest 
industrial enterprises (Sinclair 1906, 4, 29, 237). The Lowell workers, by contrast, 
entered factories at once smaller and, very probably, quieter. Certainly, these facto- 
ries were noisy, and many Lowell workers commented on the matter, often describ- 
ing the noise as oppressive and unhealthy. Critically, though, they did so in a 
cultural, economic, political, and social context that helped them braid their rural 
backgrounds with a new industrial reality. 

Here I wish to make more of the importance of gender to the shaping of the 
experience and reception of factory noise. The paternalism underwriting America’s 
first factory system was very much informed by the fact that the workers were 
women, and these imperatives led factory managers to go feminize the workplace 
somewhat both visually and, in the form of what they had the girls read, wrote, and 
learned, aurally. Many of these women were reasonably well educated, could read, 
and could draw on literary representations — as well as personal experience — to 
help frame, medíate, and represent what they actually heard on the Lowell shop 
floor. Por this reason, they held, in mind’s eye and ear, literary representations of 
sound, noise, and silence and were able to weave their actual experience in Lowell 
with their lived and read experience, thereby creating new, braided narratives that 
helped them make sense of their world. 

None of this is to say that the machine did not intrude upon the American 
garden; it certainly did. However, missing from the narrative is the way in which the 
garden became incorporated into the machine and, in the process, tattooed some of 




THE GARDEN IN THE MACHINE 


43 


the most important acoustic signatures onto the first wave of industrial life. Plainly, 
sounds did function as a metric of industrialization in the United States and at 
Lowell in particular. However, there is a great deal more to be made of that process. 
Rural sounds and factory sounds certainly did compete but not as much as the 
machine-in-the-garden metaphor leads us to believe. Instead, the way Lowell work- 
ers listened and the meaning they attached to the sounds around them suggest that 
the sounds of industrialization were not necessarily alienating or jarring (Bijsterveld 
2008, 78-79). 

To be sure, not all historians of the debate about the transition to capitalism in 
the United States have embraced the country as quiet, industry as noisy dichotomy. 
Robert A. Gross in an important article on the coming of agricultural capitalism in 
rural New England made the important point that the presence of the market revo- 
lution in rural areas — and the trains and roads and canals that facilitated it — tended 
to up a preexisting volume rather than change it qualitatively. The “new system of 
agricultural capitalism” could be heard in “the steady chopping of the ax; the bustle 
of men spading up meadows, hauling gravel, and raking hay; the clanging of milk 
pails.” Urban demand for agricultural goods in the 1840S merely intensified these 
sounds and, in this way, built on to an already existing rural soundscape. These 
were oíd sounds, sounds built upon by the distant demands of urbanizing and 
industrializing America. For the most part, according to Gross, the sounds of the 
countryside in the antebellum period remained similar in register and meaning, 
and the principal change was an increase in frequency (Gross 1982, 54). 

Historians of other regions in the early industrial North have offered helpful 
remarks on the highly stuttered transition from countryside to factory. Anthony 
F. C. Wallace’s influential “account of the coming of the machines, the making of a 
new way of life in the mili hamlets, the triumph of evangelical capitalists over social- 
ists and infidels, and the transformaron of the workers into Christian soldiers in a 
cotton-manufacturing district in Pennsylvania in the years before and during the 
Civil War,” as the subtitle of his study of Rockdale puts it, uses sound to counter- 
point preindustrial and industrial Rockdale. Wallace titles his first chapter “Sweet, 
Quiet Rockdale,” capturing, in Wallace’s ear, preindustrial (pre-i850s) Rockdale, a 
place where “There was as yet no jarring sound of locomotive engines and shrieking 
whistles.” Nascent industrialization was also tranquil. “The milis themselves,” he 
goes on, “powered only by water, whispered and grunted softly; the looms clattered 
behind Windows closed to keep moisture in the air; even when the workers were 
summoned, it was by the bell in the cupola and not by a steam whistle.” He con- 
cludes: “The machine was in the garden, to be sure, but it was a machine that had 
grown almost organically in its niche” (Wallace 1978, 4). 

Yet the larger interpretive matrix established by Leo Marx has tended to pre- 
domínate basic thinking on the history of industrialization in the New England and 
other northern States. And there is some legitimacy to that influence. In some 
important ways, the machine did, in fact, enter the American garden. This is most 
obvious with the railroad. Numerous sources from the 1830S to the eve of the Civil 
War chart the puncturing sound of the train on the putative silence and quiet of 
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the countryside. Spectators — ear witnesses — were “surprised to observe what a 
sudden change was made in the scene by the departure of the cars. A moment 
before, all had been noise, tumult and confusión. But when the sound of the engine 
died away in the distance, they found themselves left in a scene of almost entire 
silence and solitude” (Abbott 1843, 98). Here was the familiar trope of industrial 
capitalism in the form of the railroad puncturing rural quietude. 

The machine also became increasingly intrusive, so much so that the garden 
was smothered. Urban industrialization quieted the sounds of largely preindustrial, 
rural production. Rural New England workers who grew flax and processed it in 
their homes had their artisan form of craft production eclipsed by the massive fae- 
tones at Waltham and Lowell. By the i86os, the craft industry had died, and its 
death was heard. As one contemporary in the i86os remarked, “So complete has 
been the change that few persons under . . . thirty years of age, have ever . . . heard 
the buzzing of the flaxwheel” (quoted in Rivard 2002, 33). Indeed, in both the 
United States and Great Britain, the impact of centralized factory production and 
the concomitant extraction of quasi-industrial production from rural households 
was heard by an older generation, “grand-mothers” included, who, as early as the 
1830S, saw “misery and ruin cióse at hand, because the sound of the spinning-wheel 
and the loom is no longer heard in all our farm-houses” (Anón. 1832, 221). Little 
wonder, then, that the experience of the first factory workers at Lowell has been 
wrapped around the same binary by historians. Transported from supposedly 
serene, tranquil New England farms by hirers sent out from the milis to recruit 
them, they, in the words of more than one historian, “arrived in Lowell, heard the 
racket of the milis,” and returned to their “tranquil” farmsteads (Moran 2002, 1, 43). 

Part of the disruption of the pastoral ideal resided in the collateral sounds pro- 
duced by the transition, most notably the sound of urban class conflict. Paul Faler’s 
Mechanics and Manufacturen, a detailed study of the industrial revolution in the 
shoemaking industry in Lynn, Massachusetts, describes how, in the late 1850S, 
Lynn’s city officials, operating from a bourgeoisie reformist perspective, charged 
workers with “rowdyism” and equated social order and “quietness” with discipline 
and obedience. Booze and lateness caused social noise, threatening, by extensión, 
social “tranquility.” In other words, this was the sound of class conflict — which 
culminated in a strike by the shoemakers in 1860. For Faler, attentiveness to the role 
of sound and perceived noise is actually quite important in trying to capture the 
contested nature of class conflict, the formation of class consciousness, and the 
larger process of industrialization. For example, he points out how, in 1828, Lynn 
city officials passed a series of ordinances designed to prohibit what they saw — and 
heard — as objectionable. Bylaws prohibited, in Faler’s words, “insulting language 
and “making tumultuous noises.” Faler ponders these ordinances: Perhaps indus- 
trialization and accompanying urbanization in Lynn in the 1820S had increased 
noise. Nonetheless, he concludes: “It is more likely though that there had been little 
or no increase in . . . coarse language, or noise in the streets but that these practices 
had become objectionable by their incompatibility with the valúes” of the urban 
bourgeoisie (Faler 1981, 210, 118, 116-17). 
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However, is the machine-in-the garden framework necessarily helpful or even 
accurate for capturing the full range of the experience of industrialization in New 
England? In the case of the establishment and the growth of the very first factories 
in Lowell, Massachusetts, I think it less so. In fact, and as I argüe, the experience of 
the first factory operatives at Lowell was not just about the auditory history of the 
machine domineering the garden but also had a great deal to do with how the larger 
context in which the Lowell workers existed led the garden into the machine. 


Lowell: The Garden in the Machine 


Although it is well known, a brief social history of Lowell and especially of the living 
and working conditions in the factories is instructive not least because it helps pro- 
vide the context necessary for understanding the ways in which the workers braided 
their experience with the countryside with their new factory lives. Context — both 
specific and general — is key for determining how and why people heard as they 
did — for determining the meaning contemporaries attached to particular sounds. 
A quick architecture of the Lowell women’s lives allows us access to that context. 

The “first efforts to promote manufactures in this place, were made in 1813,” 
courtesy of the War of 1812, Thomas Jefferson’s Embargo Act, and efforts by 
Americans to wean themselves firom British manufacturing. Beginnings were 
modest, as most beginnings are. Initial efforts focused on woolen textile production 
and the acquisition of water rights to expand factory production. In the 1820S, more 
substantial milis dedicated to cotton textiles emerged, and the register of expansión 
can almost be heard in the statistics. In 1820, the population of Lowell was roughly 
200 souls; in 1828, it stood at 3,532; in 1833, 12,363. By 1840, the population was 
20,981. By the 1830S, the city had ten textile corporations that ran thirty-two milis. 
By 1840, the factories employed a total of about eight thousand workers, and the 
majority of them were women aged sixteen to thirty-five (H. L. 1843, 146-47). 

The “Lowell system” was a delibérate attempt to blend large-scale industrial 
manufacturing with the educational and moral improvement of its female work- 
force. The average female operative was twenty-four years of age; some arrived at 
Lowell with mothers or sisters; most were hired on annual contracts; and, on aver- 
age, the women stayed four years at Lowell. At work, the newer, younger arrivals 
were paired with older and more experienced female workers, who trained them on 
the job and also in the ways of Lowell. Male overseers held authority in the shop or 
room, each of which averaged eighty female workers. They worked from five in the 
morning until seven at night, averaging seventy-three hours a week. After work, the 
women were required to sleep and live in a series of female-only boarding houses 
located on the mili grounds. They were expected to attend church, behave in a 
moral fashion, and attend lectures sponsored by the Lowell factory owners (Dublin 

1975 > 1979). 
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In other words, industrial paternalism underwrote Lowell from beginning to 
end. The labor forcé — on average about three-quarters female — was designed to be 
temporary. Indeed, rural women were recruited by men who told tales of high 
wages and good, moral living conditions. Part of the reasoning here was to avoid 
the development of “ignorance” and “depravity” among workers, which, courtesy 
of the English (and, to some extent, the French) experience, was deeply associated 
with industrialization in New Englanders’ minds. a As long as our milis are wrought 
by operatives from the country, or from the common schools of Lowell, they will 
not be filled with a depraved and ignorant class.” That a “great preponderance” of 
the workforce was composed “of a youthful female population” animated efforts to 
institute habits and training designed to sidestep perceived depravity and also make 
the young women into good, conscientious workers. Lowell girls were, therefore, 
encouraged to become involved in religious and charitable societies, read and write, 
and improve their moral character while also learning the ropes of industrial labor 
and earning what was considered a good wage. When all was said and done, sup- 
porters of this system believed that, viewed as a whole, Lowell “might almost lead 
observers to believe that our hard-working, matter-of-fact city had been trans- 
formed to fairy land” (H. F. 1843, 147-49). 

In stressing the Lowell experience and the specific context of paternalism, I take 
my cue from Herbert Gutman, an eminent U.S. labor historian. In his seminal 1973 
article, “Work, Culture, and Society in Industrializing America, 1815-1919,” Gutman 
examined the rate, nature, and intensity of laborers’ resistance to the coming of the 
industrial order. Like E. P. Thompson before him and Jonathan Prude after him, 
Gutman stressed the process of industrialization and placed the workers’ experi- 
ence at center stage. Different generations of American workers, explained Gutman, 
accommodated or resisted industrialization depending on a number of factors, 
including the scale of manufacturing and the policies of industrial capitalist and 
factory managers. For our purposes, it is important to note that Gutman elected to 
examine the Lowell milis as typical of early industrialization. As Gutman makes 
clear, not all was roses. When Lowell began to rationalize production after 1840, 
Lowell women engaged increasingly in collective protest and were eventually 
replaced by Irish immigrants. He is also careful to point out that The Lowell Offering , 
a magazine written by the Lowell women themselves (although it was funded and 
produced by LowelLs factory owners), is a helpful source for detecting criticism of 
the factory, if read carefully. “Historians have dismissed it too handily,” says 
Gutman. It was much more than a factory-owner mouthpiece. Careful reading 
reveáis expressed dissatisfactions embedded in prose and poetry. Beyond their crit- 
icism of work speed-ups and decreasing real wages, Gutman detects in these writ- 
ings the women’s “attachment to nature . . . the concern of persons working 
machines in a society still predominantly ‘a garden.’ ” Certainly, the women’s 
expectation that they would work at Lowell only temporarily (they had no inten- 
tions of becoming a permanent proletariat) eased their transition to factory life and 
helps account for the relative absence of protest, according to Gutman. However, 
that relatively smooth transition also had a lot to do with the ways that the factory 
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owners and the workers themselves braided pastoral concerns with factory life. 
Sound and modalities of listening occupied an important place in that transition 
(Gutman 1973, 552-53). 

Of course, there is no doubt that the Lowell workers found aspects of the milis’ 
soundscapes alienating, deadening, and disruptive. For the Lowell women, there 
were certainly acoustic hurdles. One was simply vocal. Young women fresh from 
the countryside heard their own accents, their a up-country” twangs accentuated 
now that they spoke in the context of more voices. Some found the competing 
brogues mildly disconcerting, while many attempted to modulate their dialect to 
better fit in with the workforce (in Eisler 1977, 4 6). 

Also, as Gutman suggested, the Lowell authors were not unknown to vein their 
miscellaneous offerings with subtle critiques of their industrial world. Occasionally — 
for they were relatively rare — such critique carne in the form of detangling the 
sounds of nature from the noise of industry, setting the two as antagonistic and 
irreconcilable. “How beautiful, and yet how striking, is the thought, that the most 
wonderful operations of nature are effected without the noise and tumult attendant 
on the works of man,” mused S. J. H. in 1842. Human “vanity,” cc pomp, and show” 
were mere a noise” compared to “Nature’s plan, And silent work” (S. J. H. 1843, 102). 

Other mili girls expressed their angst in poetry: 

And amidst the clashing noise and din 

Of the ever beating loom, 

Stood a fair young girl with throbbing brow, 

Working her way to the tomb. (quoted in Moran 2002, 42) 

Indeed, the alienating effect of industrialization was also tackled by contempo- 
rary labor reformers through an appeal to sound. In 1846, factory operatives in 
Philadelphia urged the adoption of a ten-hour working day by depicting the ener- 
vated, exploited worker as a near-dead automaton, “listening in despondency to 
hear, through the clattering of machinery, the hour of his delivery from toil” (quoted 
in Wallace 1978, 389). 

Beyond this, though, there are very particular reasons that the sounds of Lowell 
are better understood as being consonant with the sounds of rural New England. 
There is little doubt that female workers in Lowell expected their new environments 
to be qualitatively and quantitatively different from their villages, but their experi- 
ence upon arrival in the factory town suggested that the factory boosters and the 
rumors had exaggerated that difference. Take the experience of “Susan,” a typical 
Lowell girl. She traveled from her rural village in New Hampshire and, soon after 
arriving in Lowell, wrote home telling her family her impressions of the place. 
“I waited, one day, to see the cars come in from Boston,” wrote Susan. She was, 
though, underimpressed: “They moved, as you know, very swiftly, but not so much 
like a ‘streak of lightning’ as I anticipated.” Moreover, she mused, “If all country 
girls are like me their first impressions of a city are far below their previous concep- 
tions, and they think there is more difference than there really is. . . . I see that the 
difference is more apparent than real” (in Eisler 1977, 49). Indeed, it should be 
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remembered that industrialization and industrial progress did not always equate 
with louder machines. Quieter leather belts replaced noisy gearing beginning in the 
1820S in some factories in Massachusetts and Rhode Island, for example (Rivard 
2002, 58). 

Many of the workers mediated Lowell through a pastoral idiom simply because 
that idiom was the prevailing one at the time. The mili women’s ability to braid the 
soundscapes of rural life and the sounds of the shop floor and thereby make sense 
of their transition was due, in no small part, to other highly valued cultural assump- 
tions about the importance of nature and God, a movement known as Román ticism. 
While certainly a national experience, Romanticismos roots were planted firmly in 
New England, and it emerged in the context of the long are of the American 
Enlightenment. A complicated and highly varied movement, Romanticism tended 
to read nature and God into the world partly in reaction to excessive materialism 
and industrialization and partly in reaction to what was perceived as a failed 
Calvinism. While it is certainly true that some of the most active literary advocates 
of Romanticism — most notably the Transcendentalist Henry David Thoreau — 
grimaced when they heard industrialization, even they were not above hearing 
nature in modernity. Thoreau heard the telegraph as “faint music in the air like an 
Aeolian harp” and understood “the whistle of the steam-engine” as “arousing a 
country to its progress” (quoted in Smith 2001, 127). Beyond Thoreau, Romanticism 
was broadly shared, and the Lowell women were not only a part of it but integral to 
it as well. Page after page of The Lowell Offering shows that the Lowell women 
believed God and nature were ubiquitous. cc His ñame is written indelibly upon 
every particle of created matter,” the factory floor included. u [N]oisy” grandeur” 
could be heard in nature, especially in rivers and waterfalls, just as could the “soft 
music” of “whispering winds” and other “silent accents” (Annaline 1842, 32-33). 

Other Lowell workers understood the sounds, the smells, and the experience of 
working on the shop floor in distinctly religious and natural terms: 

In the milis, we are not so far from god and nature, as many persons might 
suppose. ... A large and beautiful variety of plants is placed around the walls of 
the rooms, giving them more the appearance of a flower garden than a workshop. 

It is there we inhale the sweet perfume of the rose, the lily, and geranium; and, 
with them, send the sweet incense of sincere gratitude to the bountiful Giver of 
these rich blessings.” (in Eisler 1977, 64) 

The Lowell Offering is replete with examples of mili women braiding the sounds 
of rural life with the registers of industrial work. The Lowell women carne to under- 
stand the noises of the factory — which were very loud and, superficially at least, 
wholly alien to the soundscapes of rural new England — in the context of remem- 
bered pastoral sounds. Some suggested that the soundscape of “The Power of 
Industry” was not altogether different from the heard worlds of life back on the 
farm. “The hens’ constant cackling” was indicative of “the Spirit of Industry,” their 
product, “two eggs a day.” Village life when “Every body was busy” was heard in “an 
incessant cackle” (Kate 1842, 27). “[W]hen I went out at night,” wrote one Lowell 
worker in 1844 after a day in the cacophonous mili, “the sound of the mili was in 
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my ears.” She compared the clanking of metal, the rush of belts, and the slamming 
of the looms to her aural past. The sounds of the mili were “as of crickets, frogs, and 
jewsharps, all mingled together in strange discord.” Possibly suffering from tinnitus 
(the author’s reference to crickets is especially suggestive), the register was certainly 
different, and the sounds of the factory lacked nature’s harmony, to be sure. Still, 
the referents were pastoral as, in fact, they had to be. More than that, though, the 
pastoral metaphor functioned to ease the transition to factory labor. “Susan” went 
on to explain that after a day in the mili, “it seemed as though cotton-wool was in 
my ears, but now I do not mind at all. You know that people learn to sleep with the 
thunder of Niagara in their ears, and a cotton mili is no worse, though you wonder 
that we do not have to hold our breath in such a noise.” Susan reached into the 
loudest sound of rural New England — the roar of Niagara — and made sense of her 
new aural environment on the factory floor (in Eisler 1977, 51-52). 
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Even the dreaded clanging of the factory bell was rendered comparable to farm 
Ufe. In their literary offerings, the working women of Lowell played out the tensión 
between farm and factory life and tried to make the connection seamless. “ c What 
difference does it make,’ said I, c whether you shall be awakened by a bell, or the 
noisy bustle of a farm-house? For, you know, farmers are generally up as early in 
the morning as we are obliged to rise. 5 55 Enter the counterargument, given by 
“Ellen.” “ c But then,’ said Ellen, ‘country people have none of the clattering of 
machinery constantly dinning in their ears.’ ” “ c True,’ I replied, c but they have what 
is worse — and that is, a dull, lifeless silence all around them. The [hens] may cackle 
sometimes, and the geese gabble, and the pigs squeaP ” (in Eisler, 1977, 162). 

There were good structural, physical reasons behind the Lowell workers’ incor- 
poraron of industrial noise into pastoralism. That mili workers should have under- 
stood the sound of the factory in the idiom of nature’s sounds is hardly surprising 
because early — and, in fact, some later — New England’s milis were literally con- 
nected to and dependent on waterpower. The machine was literally in the riparian 
garden. In April 1843, “H. F.” offered a description of Lowell in which nature and 
industry played handmaidens. Water figured prominently in the braiding. “The 
city of Lowell stands upon the Merrimack river; upon a point of land, formed by the 
Concord river, at its confluence with the Merrimack, and a bend in that river, from 
which its direction is at a right angle with its former course. It is intersected by 
many canals, the principal of which is the Pawtucket . . . thus forming an island of 
the city; it being entirely surrounded by water” Certainly, the making of Lowell was 
a noisy affair with “Five hundred men . . . constantly employed in digging and blast- 
ing” to carve out the canals, but the effort was worthwhile: The labor had effected a 
quite “sudden . . . transition from the monotony of a quiet village to the hurry and 
bustle of a manufacturing city” (H. F. 1843, *45> H 7> see also Rivard 2002, 33-38; 
Lewis 1844, 242). However, the specific technology underwriting early New England 
milis — waterpower — helped in the mili women’s articulation of rural, natural 
sounds with the sounds of factory life. “Directly below my window passes the com- 
bination of nature, and human invention, forming a canal,” wrote one Lowell 
worker in 1842. Water powered this “American Manchester,” and the customary 
and traditional sound of rushing torrents could be heard in the milis, less in terms 
of the sound of the torrents themselves but more in the noise of the machinery 
powered by that same water (M. T. 1842, 37). While Lowell was louder than it 
had once been, its volume and soundscape were not necessarily in tensión with its 
audible past. 

Paternalist imperatives led factory owners to bring the garden to the machine. 
According to Susan in 1844, her experience on the factory floor was hardly pleasant — 
the hours were long, the warm rooms could be stifling, and the constant standing 
while tending the loom “makes my feet ache and swell.” Yet, taken as a whole, the 
experience was located at an intellectual, emotional, and even physical intersection 
between nature and machine. Susan considered the factory floor “Very pleasant . . . 
light, spacious, and clean . . . the machinery so brightly polished or nicely painted.” 
Space and light: both reminiscent of the countryside. Moreover, “The plants in the 
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Windows, or on the overseer’s bench or desk, gave a pleasant aspect to things.” The 
garden was literally in the machine room, water and all: “The dressing-rooms are 
very neat, and the frames move with a gentle undulating motion which is really 
graceful,” not unlike the undulations of water (in Eisler 1977, 51-52). 

But what was the valué of the garden in the machine if the machine was so 
loud it damaged the intellect? Factory operatives writing in The Lowell Offering 
also went to some lengths to deflect the criticism that the noise of factory produc- 
tion was necessarily inimitable to the life of the mind, a charge that had a deep 
genealogy and one that, as John Picker has shown, was championed by intellectuals 
at the end of the nineteenth century (Picker, 1999/2000; Agar, 2002; Barón, 1982). 
However, the factory women of Lowell maintained not only that the vita contem- 
plativa was unsuppressed by factory noise but also that the soundscape of industrial 
labor could actually benefit thinking. Reject solitude, moved a Lowell Offering writer 
in 1843, “for we are social creatures, creatures of sympathy.” Thinking, godliness, 
and spirituality were not achieved alone in nature, surrounded by “the low flutter- 
ing breeze . . .the faint murmur of the brook, the loud roar of the cataract — or 
even the rich music of the birds — these are nothing more than unmeaning, 
senseless sounds,” sounds that, experienced alone, fall on a “senseless ear.” Sound 
in social context was far more productive of social “harmony,” populating social 
space “where the current of thought could have free circulation” (Adeline 1843, 
162-63). 

Neither did entering the social world — even if it was the world of the factory 
floor — detract from the valué of thinking. Indeed, factory noise could be overeóme 
and even help. “Strangers appear to think that the noise which affeets them so 
severely and unpleasantly, must cause the same sensibilities in all who hear it,” read 
a Lowell Offering editorial. “They forget the power of habit, the benumbing influ- 
ence of every constant action of sensation. Those who live within hearing the dash 
of Niagara, become insensible to its deafening roar; and, amid the clatter of wheels, 
bands, and spindles, the still small voice within may be as plainly heard as in the 
chamber’s solitude, or when beneath a midnight sky.” “In truth,” the writer contin- 
ued, “the factory is rather favorable than otherwise to reflection. We become uncon- 
scious of the machinery’s din, and it completely deadens every other sound.” So 
much so that “the best articles in the Offering we know to have been composed in 
the mili” (Editorial 1843, 164). 

As one worker explained: “In the sweet June weather I would lean far out the 
window, and try not to hear the unceasing clash of sound inside. ... I discovered, 
too, that I could so accustom myself to the noise that it became like a silence 
to me. And I defied the machinery to make me its slave. Its incessant discords could 
not drown out the music of my thoughts if I would let them fly high enough” 
(quoted in Moran 2002, 21). Noise certainly precluded chatter among the opera- 
tives: “But, aside from the talking, where can you find a more pleasant place 
for contemplation?” Labor focused the mind, the división of labor freed it to think, 
and the noise did not intrude (in Eisler 1977, 63). More than that, managing 
the noise in order to think was imperative: “While at her work, the clattering and 
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rumbling around her prevent any other noise from attracting her attention, and 
she must think , or her life would be dull indeed” (in Eisler 1977, 66). This was 
the lived experience of the first generation of factory operatives, one in which 
sounds of the factory floor, although objectively loud, discordant, and noisy, were 
rendered manageable through prevailing New England idioms stressing nature, 
God, and mind. 


The Sectional Sounds of Lowell 


If Lowell sounded like God and nature and empowered the intellect, another factor 
made LowelEs noise not only manageable but even attractive. That was the sound 
of sectionalism and, more generally, the soundscape of a particular mode of pro- 
duction (that modes of production have particular soundscapes is clear from other 
work; see Braun, this volume, Chapter 2). Although they did not phrase it in such 
formal terms, antebellum Americans, North and South, listened to their respective 
modes of production, processing, mediating, constructing, and shaping their 
respective identities through what they heard, literally and figuratively, about one 
another. The mode of production in the antebellum South — one premised on the 
use of enslaved labor — had to northern ears increasingly accustomed to hearing a 
distinctly free-wage-labor society its own particular — and worrying — soundscape. 
Northern ears heard the voice of democracy, the hum of industry, and the hum 
of economic progress in their own society; in Southern slavery, they heard disturb- 
ing quiet, the silence of economic backwardness, and the muffling of slave voices 
(Smith 2001). 

Lowell especially was the vanguard of northern modernity and stood in stark 
visual — and aural — contrast to Southern slavery. It was the great experiment of the 
early American republic, northern style. The Lowell women existed within an 
emerging capitalist mode of production that fashioned itself in part acoustically, 
adopted its own specific aural metaphors, and carne to counterpoint itself to an 
at-once silent but scream-ridden slave South that refused to industrialize on the 
free-wage labor terms being sponsored in Lowell. 

It was this aural counterpoint against a silent, quiet, and presumably economi- 
cally backward, largely rural, and precapitalist slave South that led northern factory 
owners to hear not din but progress and harmony in their industrial ventures 
(see also Braun, this volume, Chapter 2). In some important respects, the tenor of 
factory life was heard because its counterpoint — silence — was too reminiscent of 
stagnation and, especially, death, all of which were associated with Southern slavery 
(J. S. W. 1842, 106; Smith 2001, 150-71). 

Many observers made this point, including the abolitionist Ebenezer Davies 
while steaming along the Kentucky-Ohio border in the 1840S. Davies let his ears 
do the telling. To his left stood slavery, a place half asleep, “slaves loitering in 
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half-desert fields.” To the north, from Ohio, “a confused hum is heard which pro- 
claims the presence of industry . . . and man appears to be in the enjoyment of that 
wealth and contentment which are the reward of [free] labour.” Others agreed that 
from the South “the sounds of the steam engine and the manufactory rarely falls 
[sic] upon his ears” (Davies 1849, 177; Anón. 1850, 46). 

For their part, northern sponsors of industrial capitalism stressed the silent role 
of capital in the creation of what they styled the “hum of industry” (Smith 2001, 
129). Paternalists such as William D. Haley, pastor of the First Congregational 
Church of Alton, Illinois, offered a “Series of Lectures to Workingmen, Mechanics, 
and Apprentices” in 1855 and warned them of the dangers of labor unrest by point- 
ing to aurality. Without capital, maintained Haley, quoting “one Dr. Boardman,” 
“All the businesses which now rest on a credit basis would cease. Not a hammer 
would be heard in the ship-yards. The silence of death would replace the intolerable 
but productive clatter of the foundries and machine shops.” You might not enjoy 
the noise of industrial capitalism, Haley told the workers, but you will loathe the 
silence of economic recession (Haley 1855, 31). Moreover, the sound of industry (as 
opposed to its noise) was a fixed register with similar meaning regardless of coun- 
try. In his celebratory Memoir of Samuel Slater , George White offered an account of 
“the machinery in French milis,” which echoed, quite precisely, the conventions 
used to tout American progress. The “perpetual din” was powerful testimony to 
France’s economic progress (White 1836, 341). 

But boosters were careful not to go too far. As much as they wanted to avoid the 
silent perils of slavery, they also proved anxious to fall short of what they under- 
stood as excessive industrialization. From the virtue of the hum of industry, indus- 
trial capitalists pivoted seamlessly to the argument that workers nonetheless were 
afforded quiet when they needed it, especially when it carne to the quiet necessary 
for moral instruction. In fact, American boosters nationalized the association, 
claiming that their paternalism, their American system, was superior to that of 
England’s especially. Their mili houses and Sunday schools were “quiet fortresses” 
where “the operatives with their families pass the tranquil tenor of their lives” 
(White 1836, 173, 223). 

Even in the faetones, capitalists made the case that while the sound of machin- 
ery was inherently jarring, workers earned aural respite. Sometimes they even gen- 
erated it and thereby fed the paternalist conceits of managers. Samuel Slater wistfully 
recalled the following about a factory floor in Falls River, Massachusetts: 

I shall always recollect with pleasure one little incident in one of the weaving 
rooms of the manufactory, where the noise was very distracting, arising from a 
vast number of looms going at once. The machinery suddenly stopped, and a 
strain of music aróse simultaneously from every part of the room, in such perfect 
concord that I at first thought it a chime of bells. My conductor smiled when I 
asked him if it was not, and pointed to the girls, who each kept their station until 
they had sung the tune through. (White 1836, 235) 

In this way, industrial capitalists tried to establish, through aural metaphor, 
what they wanted most: a mode of production stressing the harmony of labor 
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and capital. Sound was central to that connection, and this is what they meant by 
the cc hum of industry”: the sound of compliant labor and faetones, the former 
refraining from introducing the noise of social dislocation and labor unrest into the 
soundscape of economic progress. This is why in Lowell he could hear “the peaceful 
hum of an industrious population, whose movements are regulated like clockwork” 
(White 1836, 47, 238; Moran 2002, 15). 


CONCLUSION 


While we should remain skeptical of capitalists’ claims concerning the putative har- 
mony of labor and early industrialization, it is nevertheless true that workers 
accepted Lowell’s noise precisely because they were the sounds of freedom and not 
those of slavery and because they were framed within a larger matrix of paternalism 
and Romanticism. This is precisely what Lucy Larcom, perhaps the most famous of 
the young women of Lowell’s faetones, meant when she wrote that, having returned 
from the countryside to the mili, “I found that I enjoyed even the familiar, unremit- 
ting clatter of the mili, because it indicated that something was going on” (in Larcom 
2007, 88). There was, to be sure, class conflict in the North, but workers — at least 
the earliest generation — and capitalists agreed on their preferred soundscape, and 
there was impressive unity in that aurality. 

Certainly the context in which the factory women of Lowell were made workers 
was one in which a narrative of sound-as-progress prevailed, courtesy (at least in 
part) of capitalists who themselves were making the transition from rural to urban, 
industrial society. By pointing out that Lowell workers mediated their transition 
from rural to factory life through the aural idioms of religión, nature, and the 
emerging capitalist mode of production is not to render them mere dupes to the 
principal authors who described and promoted these developments. Rather, it is to 
suggest that they were active participants in the same processes, their voices con- 
tributing in important ways to those transitions. If we listen to them listening, we 
begin to get a deeper, more complex understanding of the role that sound and noise 
played in the early years of American industrialization. The transition to industrial- 
ism did not necessarily come with a bang, no matter how noisy, alien, and new the 
sounds of the shop floor were at first. Rather, it is more profitable and telling to 
understand why and how the first generation of American factory workers man- 
aged to medíate the new sounds and turn noises into registers they found palatable 
and consonant. 

The valué of attending to context in our efforts to deploy and capture the his- 
tories of sound and listening, broadly construed, should also be understood in 
terms that reach beyond specific historiographies. As historians of the auditory 
world begin increasingly to accept invitations by their colleagues in museums and 
historie preservation to help them use sound to edúcate the general public, we will 
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need to think carefully about the ways in which we can be helpful but reliable inter- 
preten of the heard past. After all, the curatorial tendency is to attempt to re-create 
the sounds of buildings and factories in an effort to generate public interest and 
heighten education. And this is certainly laudable. Nonetheless, as we enter these 
conversations and advisory roles, we should do our best to stress the importance of 
context to the history of sound. And therein is the challenge. Context changes 
everything, and it is extremely difficult to convey its preeminent valué to museum 
patrons. The context in which the sounds are being produced — even if those sounds 
remain the same — alters meaning and is likely not only unrecoverable but possibly 
beyond communication in mere aural form as well. Historie preservations and 
museum curators will need to provide context lest visitors to, say, the Lowell milis 
walk away with the impression that the milis were simply loud. Or roaring. Or buzz 
ridden. Yes, they certainly were that. All the same, leaving the description and expe- 
rience at that does enormous violence to the way that the actual workers under- 
stood, mediated, and carne to manage those same sounds (Smith 2007; Matthews 
2007). If historical acoustemology is to fulfill its promise outside of the academy, we 
will need to think creatively within its walls. 


NOTE 


1 http://www.lowell.com/museums.boott-cotton-mills-museum. This chapter benefits 
materially from insights offered by Michael Bull, Karin Bijsterveld, Myles Jackson, Norm 
Hirschy, and fellow contributors to the volume. A special thanks to David Prior for 
invaluable help with the research. 
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CHAPTER 2 


TURNING A DEAF 
EAR? INDUSTRIAL 
NOISE AND NOISE 
CONTROL IN 
GERMANY SINCE 
THE 1920S 


HANS-JOACHIM BRAUN 


Introduction 


The title of this chapter, “Turning a Deaf Ear?” sounds catchy, of course, and is 
rather ambiguous . 1 In dealing with industrial noise and noise abatement in the 
twentieth century, however, this title also serves as a sort of leitmotiv because, at 
least before the 1970S, industrial noise abatement was not very popular. Employers 
and employees turned a deaf ear to this noise not only in a metaphorical sense but 
also in a literal sense because many employees were already deaf or hard of hearing 
because of long-term exposure to industrial noise. 

Although today almost everyone in the industrialized world will identiíy noise 
as a major problem, it is surprising that so little has been published on its history. 
Whereas several publications treat the history of “environmental” or “public noise” 
and noise-abatement measures, particularly as regards means of transport such as 
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automobiles and airplanes, historians have largely ignored the issue of industrial 
noise. Several case studies discuss various countries and periods, but the only com- 
prehensive survey on this topic is Bijsterveld (2008). In addition, several smaller 
surveys cover different aspects of the topic such as measuring noise (Thompson 
2002; Bijsterveld 2008) and noise legislation (Wiethaup 1967; Koch 2002). Aside 
from Bijsterveld’s work and a study by Mark M. Smith (2001) on the nineteenth 
century, however, there is a conspicuous lack of information on what happened on 
the shop floor. To what extent, for instance, did employers or workers take advice 
from doctors and industrial psychologists seriously? How did specific recommen- 
dations by insurance organizations and governmental regulations influence noise 
concerns? 

This chapter focuses on industrial noise and noise abatement in three different 
regimes: National Socialism during the Third Reich, socialism in the Germán 
Democratic Republic, and (Western) democracy in the Federal Republic of 
Germany. It is mainly based on contemporary journal articles, on árdeles in com- 
pany journals, and on interviews. My argument reveáis that the political economy 
within these regimes played a large role in how they dealt with unwanted sound. 
Questions addressed inelude the following: To what extent did ideology influence 
or even determine legislation on noise abatement? What was the relationship 
between theory and practice? Was employees’ resistance to hearing-protection 
devices completely irrational, or did it make sense? Part of the argument explores 
the use of music on the shop floor in the three Germán cases, also as compared to 
such practice in Great Britain. 

The phenomenon of work-related noise dates back at least as far as Román 
antiquity. A major way of tackling this problem was zoning. For example, copper 
millers were not permitted to set up shop in a Street where a scholar lived. In early 
modern Europe a similar approach was taken: Noisy handworkers were required to 
perform their activities either in a special quarter of town or outside its borders 
(Bijsterveld 2008, 56). 

The Prussian General Trade Code ( Preufiisches Allgemeines Landrecht ), which 
became binding for the North Germán League in 1871, required workshops and 
faetones to obtain a license. From 1895 onward, at the height of modern-day indus- 
trialization, conditions became stricter: Forges had to put in double walls, double 
doors, and double Windows. Generally, industries could be prohibited or restricted 
when they severely disturbed the peace of churches, schools, hospitals, or other 
public institutions (Bijsterveld 2008, 61). 

In Germany it was common legal practice to legitimize new noise on the 
basis of existing noise, and this made it difficult to change matters substan- 
tially (Bijsterveld 2008, 67-69). In the early twentieth century, there were antinoise 
campaigns in the United States, Germany, and other countries that were mainly 
directed against Street noise rather than industrial and occupational noise. In most 
cases, however, these campaigns had little success (Thompson 2002, 115-30, 
157-68). 




6o 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


Industrial Noise and Noise 
Abatement in Germany in THE 
Late 1920S and the Third Reich 


Immediately after World War One, the issue of noise was still neglected. Even 
during the mid-i920S, the “golden years” of the Weimar Republic, industrial 
firms concentrated on rationalization and increasing efficiency instead of on 
noise abatement (Braun 1990, 46-52). In November 1927, however, after pressure 
from trade unions, physicians, and others, the Germán Society for Industrial 
Hygiene began to investigate the issue of health damage by industrial noise. It 
established a “Committee for Industrial Noise Abatement,” made up of employ- 
ers and representatives from the fields of Science, technology, medicine, and public 
administraron. This committee put together a memo with various recommenda- 
tions regarding the implementation of noise-reducing devices or work processes, 
but not much was done with them. Next, in August 1928, the Technical Commission 
of the “Committee for the Fight against Hearing Damage Caused by Industry” was 
founded, but it soon became clear that business interests, such as the high cost of 
industrial noise-abatement measures, prevailed (Braun 1998, 255-56). 

In the context of Germany’s Second Occupational Diseases Regulations 
of 1928, hearing damage was recognized as an occupational disease, which 
implied that social security organizations had to compénsate workers. This resulted 
in several industrial noise-abatement measures, but they were implemented in 
the metal industry only. Furthermore, the professional organization of Germán 
engineers, the Verein Deutscher Ingenieure (VDI), also tackled the problem 
of industrial noise, dealing mainly with issues of physics and technology. In 
November 1930 the VDI set up a commission for abating industrial noise 
that cooperated closely with the Berlin Heinrich Hertz Institute for Vibration 
Research and its president, Karl Willy Wagner, a well-known specialist on 
acoustics who had cióse contacts with the Bell Laboratories in the United States. 
In cooperation with the Germán firm Siemens, he designed a new noise meter 
based on a reproduction of the human ear, which was superior to the Barkhausen 
noise meter widely used in Germany. Although critical of National Socialism, 
Wagner did not voice his reservations in public. He became Germany’s most prom- 
inent advócate of the fight against industrial noise and was appointed chairman 
of the relevant Association of Germán Engineers’ Commission (Braun 1998, 
256-58). 

Noise abatement was a big topic in National Socialism, at least in its propa- 
ganda. The Führer himself claimed that for victory in the struggle for existence 
he needed people with strong nerves (Wagner 1935; Bijsterveld 2008, 130-31). Noise 
weakens nerves. This view, which was readily taken up by some leading 
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Nazi propagandists, seemed to underscore the issue that noise abatement was 
perfectly suited for propaganda purposes. The Nazi office for “Beauty of Labor” 
(Schónheit der Arbeit) — part of the Nazi organization “Strength through Joy,” 
which in turn belonged to the Germán Labor Front — put in a major effort to 
bring home the topic of noise to Germán workers (Welch 2002). One of the reasons 
was that many Germán workers who had formally supported the now illegal 
Communist party or the Social Democrats were still skeptical of National Socialism. 
It was believed that success in foreign policy or on the “home front” would 
lure them away from left-wing ideas. Much like the communists, the National 
Socialists promised a “classless society,” and in the Germán Labor Front employees 
and employers joined hands to work for the common good — for the “people’s 
community.” 

From 1934 onward, different campaigns aimed at satisfying workers’ demands 
for a better and safer workplace were launched. The 1935 “Fight against Industrial 
Noise” campaign was followed in 1936 by the “Good Light, Good Work” campaign 
and in 1938 by one on “Fresh Air in the Workplace.” In the 1930S several journals 
reported on noise-abatement measures in industry. These included not only 
so-called primary measures, geared to the design of machinery in a “noise-friendly” 
way or to the replacement of hammering by rolling in metal working, but also sec- 
ondary measures, aimed at noise reduction through encasement or encapsulation 
(Hasse 1939). 

From April 7 to April 13, 1935, a noise-abatement week took place in Germany 
(Friemert 1980). Although there had been much propaganda in advance, its orga- 
nizers soon found out that their job was far from easy. The “Beauty of Labor” office 
had failed to compile a survey of noise-abatement measures in Nazi Germany, and 
it soon became clear that the legal basis for protecting factory workers from indus- 
trial noise was rather weak (Friemert 1980, 133). One could merely appeal to factory 
owners to impro ve shop-floor conditions, but the office itself was not authorized to 
issue regulations. If the noise-abatement week did alert workers to the problem of 
industrial noise, partly with the help of mass-media coverage, the project’s impact 
was small. One of the underlying reasons was that it proved difficult to visualize the 
issue of noise (Friemert 1980, 135). 

A few years later, in 1938, the Association of Germán Engineers and the chair- 
man of its noise-abatement committee, Karl Willy Wagner, in cooperation with the 
Ministry of Labor and the Germán Society for the Protection of Workers, founded 
the Committee for Noise Abatement in Industry (Hasse 1939). Together with the 
Museum for Industrial Hygiene they organized a conference on noise abatement in 
industry that was geared to the interests of engineers, engineering designers, factory 
managers, and officers of employers’ liability-insurance associations. Although the 
specialists’ discussions seemed to have been lively, practical effects were negligible 
because the Germán government gave priority to other issues, including increased 
armament efforts (Friemert 1980). 



6 2 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


Masking Industrial Noise and 
Increasing Productivity: “Music while 

You Work” 


Several recent publications have focused on the use of music to influence consumer 
behavior in retail stores, but so far the role of music at work has been little studied 
(but see Bijsterveld 2008, 81-87). As early as 1915, Thomas Alva Edison used a pro- 
grammed selection of phonographic music for factories to find out to what degree 
it could mask hazardous drones and boost morale, but early loudspeaker transmis- 
sion technology was still too weak for this purpose (Lanza 1994, 13; Thompson 2002, 
143). After experiments on the use of music in an architectural drafting room in 
1921, American psychologist Esther L. Gatewood argued that familiar, instrumental 
music influenced productivity in a positive way (Gatewood 1921; Uhrbrock 1961). In 
their widely quoted 1937 study, British researchers S. Wyatt and J. N. Langdon 
argued that music diverted the mind from monotonous working conditions, made 
time pass more quickly, and created a more cheerful attitude toward work (Wyatt 
and Langdon 1937). 

In this context the BBC developed its “Music while You Work” program, which 
was broadcast from June 1940 onward in two daily half-hour editions at 10:30 a.m. 
and 3 p.m., with a third half-hour program at 10:30 p.m. for night-shift workers. 
Numerous tests were done to find out more about the right kind of music to play. 
These tests showed that the music most appropriate for both masking noise and 
making people working in factories feel happy had to aim for consistency in rhythm, 
volume, or tempo. The emphasis was on “bright, cheerful music.” Vocal music 
counted as distracting and could even affect discipline negatively. “Music while You 
Work” producers optimized their broadcast in order to mask factory noise 
(Korczynski and Jones 2006). 

Masking industrial noise with music was also a topic in the Third Reich. The 
key researcher in this field was Rudolf Bergius, a psychologist whose 1939 doctoral 
dissertation dealt with experimental studies on “Distraction from Work by Noise 
and Music and Its Structural-Typological Context” (Wehner 1992). Familiar with 
Anglo-Saxon research, Bergius reported on the outcome of experiments that 
showed that work performance improved with rhythmic noise, whereas it deterio- 
rated when noise was arhythmic. He argued that Germanic people were more sus- 
ceptible to noise than others, particularly Romanic people. Life in Southern countries 
was generally louder than in northern ones. Contrary to the Romanic race, which 
tended to be extroverted, Germanic people were generally introverted and had 
more problems coping with noise (Bergius 1939). 

In the late 1930S several industrial firms in Germany used music on the shop 
floor to increase workers’ productivity. However, there was still no formal program 
similar to the British “Music while You Work” broadcast. Bergius claimed to have 
found out that in cases in which music in factories was about equal in loudness to 
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machine noise, music prevailed and that, in the listeners’ perception, industrial 
noise seemed to adapt to it rhythmically. Although industrial noise may be pre- 
dominantly rhythmical and not cacophonous, music had the great advantage of 
being “designed” and therefore stood out. In Bergius’s view, however, it would be a 
cultural scandal if industrial noise were to be suppressed by music. After all, music 
was a product of culture, preferably of Germán culture. Although “designed” 
music prevailed over industrial noise, this did not mean that industrial noise had 
no harmful effects on the human organism (Bergius 1939, 112). 

From 1937 to 1939 music from recordings and broadcasts was played for 
a few hours every day in Germán faetones, including one in Berlin that employed 
130 tailors (108 women and 22 men), most of whom were working on sewing 
machines to produce military uniforms. The Nazi “Strength through Joy” organi- 
zation and the “Germán Labor Front” supported music at work primarily because 
it fostered a sense of community rather than actually boosting output. The bonds 
within the factory community, which formed part of the larger “Germán peoples’ 
community,” were to be strengthened by this practice. Obviously, the “factory 
leader” and the government also welcomed any increases in productivity. Although 
the exact number of loudspeakers in the factory is not given in Bergius’s study, it 
must have involved a considerable number in order to mask the din of the numer- 
ous sewing machines. Contrary to the practice in Britain, music would accompany 
the start of the morning shift for half an hour and was again played at the end. 
There seems to have been no research on the most appropriate time for playing 
music; this particular sequence was chosen because the “factory leader” believed 
that employees needed special encouragement in the morning and also when the 
shift was about to end (Bergius 1939, 130-39). 

As in Britain, light music, dance music, and popular songs were played, but, in 
Germany, marching music, reflecting the popularity of the military, was also in 
demand. Unlike in British factories, songs were favored, and employees were allowed 
and even encouraged to sing together during work so as to boost group morale. It 
seems, then, that in this case Nazi ideáis were not only preached programmatically 
but also put into practice, possibly at the cost of increased productivity. 

Bergius also studied the acceptance of music in factories and found out that, as 
in Britain, most employees welcomed music on the shop floor. However, in his 
empirical research, some of his questions were different from those of the research- 
ers in Britain. This had to do with the particular interests of Germán academic 
psychology at the time. Apart from his interest in the allegedly different behavior 
of northern and Southern Europeans, Bergius also made a distinction between dif- 
ferent body types, following the Germán psychologist Ernst Kretschmer: pyknian, 
leptosome, and athletic. He claimed that the leptosome types were (positively or 
negatively) affected by music the most, more so than the pyknians, whereas the few 
athletic types among the employees, who exhibited a “tough temperament,” did 
not seem to care very much whether there was music on the shop floor. Women 
were, according to Bergius, generally more affected by music than men (Bergius 
1939> 138). 
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To realize the goal of increasing labor productivity and masking industrial 
noise, the music played in faetones should be just marginally louder than the 
machines. If it was not loud enough, workers had to concéntrate too much on lis- 
tening to it, which distracted them from work. Workers generally welcomed the 
music program, but one female worker was opposed to it. Why was this? Bergius 
first assumed that this woman exhibited a stubborn, oppositional attitude toward 
the community spirit and probably needed some refresher course in Nazi ideology. 
Simply asking her for her reason, however, yielded a much more favorable result, 
which doubtlessly pleased Bergius, a man of culture: The woman proved to be a 
great music lover and often listened to music at home, concentrating solely on the 
notes. She felt that listening to music in a factory did not do justice to music as a 
lofty product of culture (Bergius 1939, 139). 

“Music while You Work” also played a role in the Germán Democratic Republic 
(GDR), although, according to Soviet and GDR authors, its function was different 
from music in capitalist countries. In the latter the main aim was to increase output, 
whereas in the socialist States the well-being of the workers carne first. In the Soviet 
Union several industrial companies started introducing music in their factories, but 
their number remained small. As several authors explain, the music-selection pro- 
cedure and loudspeaker installations in factories were amateurish, which under- 
mined the benefits (Klughard and Schwabe 1975). In the early 1970S the USSR Labor 
Research Institute in Moscow carried out a research project on the music most 
appropriate to enhance job satisfaction among Soviet workers. The factories that 
adopted industrial music dealt mainly with assembly work and simple, monoto- 
nous labor processes. Research results showed that light music and folk music were 
particularly well received by the workers (Klughard and Schwabe 1975, 610-12). 

Based on these findings, GDR researchers conducted a study in a film factory in 
Wolfen (Saxony Anhalt) to find out what kind of music was most appropriate for 
enhancing workers’ job satisfaction. Their hypothesis, taken from the results of 
investigations in the Soviet Union, was that music on the job increases the well- 
being of workers, reduces fatigue, and enhances the ability to concéntrate and 
therefore the quality of the product. Workers should not need to concéntrate much 
on the music, implying that instrumental pieces were to be preferred over vocal 
ones. Even though the latter sometimes distracted from work, they were particu- 
larly popular, especially when sung in Germán. The music should be rhythmic and 
have little variation in tempo (Klughard and Schwabe 1975, 617). Since we have little 
information on the extent to which these findings were applied in other 
companies, it seems that industrial music did not play a large role in the GDR. 

In the Federal Republic of Germany (FRG), music at work was much more 
widespread. In 1966 it was estimated that fifteen thousand factories used music pro- 
grams on the shop floor or allowed workers to listen to their own radio sets (Meissner 
1976). One of the organizations that supported the aim of increasing productivity 
through music was the Germán industry’s “rationalization committee.” From 1960 
onward, it sponsored a research project on the effeets of industrial music, in which 
nine companies from several branches — textiles, machine tools, printing machines, 
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food processing, precisión engineering — participated. This resulted in a well-docu- 
mented dissertation on industrial medicine by Günter Last from the University of 
the Saarland in Saarbruecken, published in 1966. Last’s main aim was to test the 
hypothesis put forward by supporters of industrial music, notably in Britain and 
the United States, who claimed that music at work increased job satisfaction and 
labor productivity significantly (Last 1966). 

The most general finding of Last’s investigation was that, as in most other com- 
panies in which these experiments were carried out, job satisfaction did in fact 
increase, albeit to different degrees. As a whole, industrial music enhanced the 
working atmosphere, reduced unnecessary conversations on the shop floor, coun- 
tered monotony, and reduced fatigue (Last 1966, 89). In fact, in a factory producing 
machine tools and printing machines, several workers were prepared to work over- 
time only if industrial music were provided. Although musical preferences varied, 
there was, as in Great Britain and the GDR, a penchant for pop songs and dance 
music. Songs that workers could hum or even sing along with were particularly 
popular, if not with their employers, who worried about distracted employees, 
lower output, and defects caused by lack of concentraron. 

A problem aróse in factories in which employees were exposed to noise levels 
that differed significantly in the various areas. Introducing industrial music only in 
areas with a comparatively low noise level caused resentment in those employees 
who worked in noisier surroundings, so music was broadcast either in the factory 
as a whole or not at all. Last found out that after having listened to music at work 
for a long time, workers got used to it, and its activating effect disappeared. 

Contrary to the outcome of earlier research on the effects of industrial music in 
Britain and the United States, Last carne to the conclusión that industrial music did 
not have a significant effect on labor productivity, absenteeism, job fluctuation, or 
defect frequency. If there were any such effects, they were but small and statistically 
hardly relevant. Slight increases in labor productivity in the presence of industrial 
music could be due to less conversation between workers on the shop floor or to 
their not leaving their workplace because of the monotony of work. Last felt it to be 
impossible to make more detailed claims about the effect of music on labor output 
because numerous other factors were involved. Therefore, the role of industrial 
music could not be conclusively determined (Last 1966, 79). 

It is no surprise that there were differences regarding concept and application 
of industrial music in Britain, Nazi Germany, the GDR, and the FRG. In Nazi 
Germany and the GDR the ideology of enhancing the workers’ well-being by play- 
ing industrial music was to the forefront, but in practice industrial music on the 
shop floor did not play a large role, most probably because the productivity effects 
were considered small. In capitalist Great Britain and the FRG, not so much the 
well-being of labor but job satisfaction, a notion closely related to the former but 
different from it, was highlighted. The champions of “Music while You Work” in 
Britain claimed a notable increase in labor productivity through music, which was 
also due to the masking of noise, whereas Last’s research on the FRG in the early 
1960S established a negligible productivity effect of music. 
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Hearing Protection Devices 

AT THE ShOP FlOOR 


As early as 1907 the “Ohropax” earplug was introduced, but workers were generally 
not very eager to use it or similar devices. Many of them regarded the use of ear- 
plugs as “unmanly” and were not worried about noise-induced hearing loss, which 
carne only slowly and gradually. On a symbolic level, noise in industry also had 
positive connotations. As Mark Smith also shows in his contribution to this volume, 
a noisy factory meant that industry was booming and earnings were good. Besides, 
wearing a hearing-protection device made workers insecure about where the noise 
carne from while also causing communication problems. Listening to machine 
noise (or sound — because noise/sound was often not perceived as unpleasant) gave 
workers a feeling of security; they would hear it when something was wrong. 
Furthermore, wearing ear protection could even be dangerous because one could 
also fail to hear particular warning signáis. Only from the late 1950S onward did 
protective devices like earplugs and earmuffs grow more common. Employers and 
liability- insur anee associations insisted on their use mainly because they faced rap- 
idly increasing compensation claims for employees’ hearing losses (Bijsterveld 2008, 
69-76). 

One of the reasons this took so long was that it was difficult to establish how 
occupational noise was linked to hearing loss. Employers and employees often 
argued that a partial loss of hearing was not critical to performance on the job. Until 
legislation set noise-exposure limits in the late 1960 and 1970S in many European 
countries, it was largely left to the industrial workers themselves to use hearing 
protection. Physicians’ advice failed to impress them because, as hinted at earlier, 
such caution clashed with their cultural valúes and the symbolism of noise, with a 
shop-floor culture of listening to machines and with the overall positive meaning of 
loud sound, as well as for their routines on the shop floor (Bijsterveld 2008, 

77-81). 

A 2005 study assesses the influence of hearing-protection devices (HPs) on the 
recognition of warning-horn signáis during work on railway tracks (Lazarus 2005). 2 
The outeome was that, with normal hearing, a frequeney-independent hearing pro- 
tector, a plug, improved hearing performance, whereas a frequeney-dependent HP, 
a muff, tended to worsen it, particularly with low-frequency noise. Regarding 
employees with a hearing impairment, hearing performance diminished when 
plugs were worn. For subjeets with normal hearing, low-frequency interference 
noise led to a major deterioration in hearing performance when earmuffs were 
worn, whereas hearing performance with high-frequeney interference noise 
remained almost unchanged. If the subjeets, whose average hearing loss measured 
at frequencies of 0.5/1/2 kHz did not exceed 20 dB, wore earplugs, the warning- 
horn signáis could be heard equally well or even slightly better. However, if 
hearing loss was greater than 20 dB and the subjeets wore earplugs, the higher their 
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hearing loss, the more their perception of warning-horn signáis deteriorated. From 
this it follows that when selecting a hearing protector, the frequency distribution of 
noise, signal attenuation, and hearing loss all have to be taken into account. In 
normal cases, in which hearing protectors cannot be adapted to the spectriim of 
signáis and noise, hearing-protection devices that offer frequency-independent 
attenuation (plugs) should be used, but only if the subjects do not suffer severe 
hearing loss. 


Protecting Comrades: Industrial 
Noise Abatement in the GDR 


When considering noise and noise abatement in the Germán Democratic Republic, 
the first socialist State on Germán soil, the question arises whether the GDR cared 
more about this problem than nonsocialist States. In theory this should have been 
the case because, after all, a prime aim of the socialist revolution was to ensure the 
well-being of the working class. Indeed, apart from outperforming capitalist coun- 
tries by means of a more competitive industry, the government of the GDR consid- 
ered it a humanistic obligation to improve labor conditions. 

Attempts at noise abatement in industry were led not only by governmental 
bodies but also by institutions such as the “Chamber of Technology,” the equiva- 
lent of the Federal Germán Society of Engineers, which were very active. The 
Dresden institute in charge of improving labor conditions spent much time and 
effort carrying out relevant research. The “Protection against Noise” bureau offered 
counseling and made recommendations, worked out standards and guidelines, and 
coordinated R&D on the acoustics of machinery. In line with international efforts 
to curb industrial noise at the source (primary noise-prevention measures), the 
foremost goal was to replace oíd machinery with new, designed to reduce noise on 
the shop floor (Scheuren 2002; Parthey 1989). 3 

At the company level, noise-abatement activities in the GDR were part of the 
organization in charge of occupational safety and protection against fire. 4 In a com- 
pany, the foreperson or department manager served as leader of a collective, and 
that person had to make sure that employees heeded the rules and regulations 
regarding protection against noise. The company director, to whom the foreperson 
was accountable, had a supervising and controlling function. The crew leader had 
to acquire the relevant knowledge of occupational hygiene and had to attend courses 
on that topic to obtain a certifícate of qualification, which had to be renewed every 
two years. 

Moreover, the companies established inspections of workers’ safety. The safety 
inspector made sure that the foreperson fulfilled all work-safety duties and reported 
to the company director. The foreperson was also authorized to issue directives on 
occupational safety. The safety inspector cooperated with the company’s health 
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Services: In small companies with the company doctor, in medium-sized compa- 
nies with an infirmary on the premises, and in large companies with the company’s 
hospital. If deemed necessary, the safety inspector ordered noise measurements or 
auditory checks. 

The GDR bureaucracy concerning industrial hygiene had even more to offer: 
Another organization dealing with the issue of occupational noise abatement was 
the trade unión. In the GDR there was only one trade unión, the “Free Germán 
Labor Union,” a unified labor unión. Virtually all employees, including forepersons 
and directors, were members of this organization. At the company level, the unión 
appointed a representative in charge of occupational safety. On a more general level 
the Free Germán Labor Union appointed occupational safety inspectors who coop- 
erated with their counterparts who were employed by the company but also checked 
them and gave them instructions. They tried to ensure realization of the labor 
unions’ objectives, which in theory — but not always in practice — were identical to 
the company’s goals. The company and the trade unión inspectors had to make 
sure that the employees knew the relevant regulations, were alerted to new ones, 
understood them, and received answers to their questions. In case of an accident 
the first questions normally were these: Who was responsible? Who had to be 
alerted? Had the workers involved in the accident heeded the work safety regula- 
tions, which were regulations of a legal character that governmental bodies had 
worked out in cooperation with the trade unión? In addition, the technical stan- 
dards had legal status as well. All of this shows that, organizationally and institu- 
tionally, there was, at least officially, a tight net of checks and balances in the GDR 
noise-abatement bureaucracy. 

But how well did it work? As we know from GDR economic and political his- 
tory, being tied to five-year plans and a stifling bureaucracy did not make for indus- 
trial efficiency (Bahr and Petzina 1996; Bentley 1984). At a conference in 1987 
celebrating the thirtieth anniversary of the GDR’s Noise Abatement Commission, 
one of the main speakers, praising the achievements of noise abatement in the GDR, 
gave productivity increase a more prominent place than the welfare of the workers. 
Because the GDR desperately needed currency, machine noise, he argued, had to be 
reduced not least because of exports partly to oil-producing countries such as the 
United Arab Emirates. These export countries often had ambitious standards 
regarding machine noise, and, of course, the GDR had to compete with capitalist 
countries, which had better financial resources to meet those standards. A low noise 
level emitted by machinery, then, was a vital factor in selling machines to other 
countries . 5 Nevertheless, some of the other socialist countries caused problems. 
Most of the GDR exports went to the COMECON (Council for Mutual Economic 
Assistance) countries. In the late 1980S the Soviet Union had a standard of 80 dB (A) 
máximum for new machinery, whereas the GDR worked with a máximum of 85 
decibels . 6 Reducing the noise level of the GDR to 80 decibels would have increased 
the financial difficulties of the GDR firms even more. 

However, apart from the obligatory praise of the GDR concept and practice 
of noise abatement, there was criticism of various sorts. For example, machinery 
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producers gave out unreliable information on noise emitted. If a company was 
“well connected,” it was possible to obtain permits to manufacture machinery that 
did not meet the noise-emission requirements. Sometimes factories also had a 
shortage of noise-abatement specialists. 7 

Although at the anniversary meeting in 1987 participants leveled criticism at 
how GDR authorities dealt with the problem of machine noise, the reality on the 
shop floor was generally more severe. After attempts at reform in the 1960S to make 
the five-year plan more flexible and give material incentives to innovators, business 
as usual prevailed. The recurring problem was lack of funds. Keeping up full 
employment in the GDR and paying for the social security system, the army, and 
the huge bureaucracy were expensive indeed (Braun 1996; Wienhold 2006, 2008; 
Jarausch 2008). Conversely, productivity was low. Whereas in the late 1980S FRG 
researchers had assumed that productivity in the East Germán industry was about 
65-70 percent of that in West Germany, research after Germán reunification showed 
that GDR productivity in the 1980S had reached only about 35 percent of that in 
West Germany. Therefore, not many resources were available for designing new 
noise-reduced machinery. Whereas the FRG imported much technological knowl- 
edge not only from the United States but also from the many federal Germán mul- 
tinational corporations that were instrumental in technology transfer, little of that 
took place in the GDR (Schweres 1991). 

Although several firms in the GDR could compete internationally, and they 
were duly highlighted by the government, the full picture was much less favorable. 
For this reason there was hardly any development of new, noise-reduced machin- 
ery. The common picture in GDR factories was that machinery was seldom replaced, 
was repaired again and again, and was used until it fell apart. Funds were seldom 
available because they had to be used for importing energy and raw materials, a 
sector in which the GDR was rather weak. For the standards of COMECON coun- 
tries regarding machine noise, the producís the GDR had to offer were generally 
good enough. After reunification the enormous environmental problems that had 
accumulated during the GDR times also became visible (Schweres 1991). 

As early as 1978, Herbert Hartig, vice director of the Dresden Institute for 
the Protection of Labor, had reported on important issues of industrial noise and 
noise abatement in the GDR. He made it clear that protection of workers against 
industrial noise not only pertained to individual welfare but also had economic 
ramifications. Which of these aspects involved in industrial noise and noise 
abatement were decisive depended on the social system; in marked contrast to the 
capitalist system, in socialist countries the humanistic aspect always carne first 
(Hartig 1978). 

However, also in socialist countries like the GDR, the compensation paid to 
workers who suffered from hearing difficulties caused by industrial noise was sig- 
nificant. In addition, according to Hartig, workers with hearing impairment were 
less productive and, if they quit, had to be replaced by new workers, who also had 
to be trained. Moreover, “noise jobs” were not popular and could be filled only by 
offering higher wages (Hartig 1978, 333-34). Noisy machinery for the export market 
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caused losses in foreign trade because foreign customers did not accept them or 
were prepared to buy them only at reduced rates. Of course, the efforts by the 
Socialist Unity Party (SED) party, the government, and the health-protection offi- 
cers led to some improvement in the industrial noise situation in the GDR. As to 
primary measures of noise abatement, machines were designed that emitted less 
noise. 

However, the situation left much to be desired, and several goals to improve 
the situation had not been realized. In particular the claim that industrial progress 
and efficiency increases made for better working conditions had proved to be a 
myth. Productivity increases had led to a spatial concentration of machinery and 
to significantly higher speeds being used in drive belts. Research at the Dresden 
Institute showed that a doubling of revolutions per minute in machine drive 
caused a 12-dB increase in sound emission. The institute had also tried to work 
out a mathematical formula on the relationship between the reduction of sound 
emission and an increase of labor productivity. Because of the many different 
factors involved and the large differences in the jobs investigated, this proved 
impossible. However, as a rule of thumb, the institute stated that a reduction of 
industrial noise emission by 5-10 dB resulted in a 5-percent increase in individual 
job performance. This, however, applied only to relatively simple, repetitive 
work, whereas in work that demanded more concentration, such as that of crane 
operators, performance loss caused by industrial noise was significantly higher 
(Hartig 1978, 335). 

If it had not been possible to satisfactorily implement primary industrial noise- 
abatement measures through machine design or, for that matter, to apply second- 
ary measures such as encapsulation, what about the application of individual 
ear-protection devices like earplugs or mufflers? The legal and regulatory frame- 
work for this existed: In the GDR, industrial workers’ hearing difficulties were 
regarded as an occupational illness from 1958 onward, and a 1970 regulation made 
it obligatory for all factory managers to introduce ear-protection devices whenever 
a certain threshold valué was surpassed. In 1977 there were about 250,000 industrial 
“noise jobs” in the GDR (i.e., jobs where primary or secondary noise abatement 
measures had not been successful). Theoretically, workers could still have been 
protected at least to a significant degree by individual ear-protection devices. As it 
turns out, however, the industrial health organizations’ campaigns were far from 
successful. Investigations of some 90,000 industrial noise workers in the GDR 
showed that only 18.2 percent regularly wore individual ear-protection devices. 
Moreover, 16.5 percent did not wear them at all, and 65.3 percent wore them only 
intermittently or not according to official requirements. So only about 25 percent of 
industrial workers in noise jobs wore ear-protection devices properly (Hartig 1978, 
337-38). 

Aside from simple earplugs, more sophisticated devices were available, but they 
found only limited application. These were otoplastics that sealed the ear com- 
pletely. They put higher demands on their manufacture and had to be fitted by an 
otologist. Earplugs made from synthetic material and fixed on a support bracket 
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seemed to have several advantages, including that they sealed the ear canal tightly, 
were relatively easy to produce, and had to be worn only when necessary. However, 
they often exerted an unpleasant pressure on the ear, especially when the wearers 
moved their head. They were particularly useful in dealing with intermittent 
noises. 

Mufflers have the advantage that they cover the whole ear and offer a high level 
of sound protection. However, wearing them can be uncomfortable, especially 
when temperatures are relatively high and when they are worn in combination with 
a protective helmet. In wearing ear-protection devices one factor is particularly rel- 
evant: It takes some time to get used to them, mainly because many of these devices 
have different degrees of sound abatement at varying frequencies. Moreover, the 
warning function of ears is severely impaired by wearing an ear-protection 
device . 8 

The example of the Diesel Engine Works Rostock, which produced marine 
diesel engines, illustrates the general remarks in this chapter. This company was 
located in the Southern part of the city of Rostock, Mecklenburg, cióse to the Baltic 
Sea, and was established in the early 1950S. At that time the buildings and the 
machinery were State of the art, but the problem was that, due to shortage of funds, 
little money was leít for modernization and replacement. The factory produced 
marine diesel engines with 4,000-20,000 horsepower, and it repaired smaller diesel 
engines. The main features of the production process were mechanical production 
(chipping), metal working (cutting, welding), hot treatment (postweld heat treat- 
ment, founding of bearings), assembly, transport within the company, and test runs 
(Spychala 2006, 2008). 

Most of the production units had a médium noise level, which did not require 
any hearing-protection devices. Metal working was an exception, especially when, 
for example, pneumatic chisels had to be used or straightening work after welding 
became necessary. To reduce workers 5 exposure to noise, an adaptable noise- 
protection ceiling was installed above each workplace, and workers had to wear 
earplugs or earmuffs. 

The situation in Rostock, as described thus far, differed little from that in other 
machine-building companies. But one problem was particularly enervating: The 
test runs could be extremely loud, especially at frequencies that humans find diffi- 
cult to cope with. The company at Rostock utilized different kinds of test stands, 
including one for engines manufactured in the Soviet Union with a high noise level. 
After repair, those engines were installed in ships of the People’s Navy. Because 
noise emission at test runs was quite high, a new, compact building was erected 
with a special control room for the operators. Even outside the test area the noise 
level was high and affected not only the employees at the diesel engine works but 
also nearby residents. Particularly unpleasant were the high frequencies emitted as 
a result of the engines 5 high number of revolutions and, especially, of the exhaust 
gas turbochargers. There do not seem to have been any restrictions on these testing 
procedures; any complaints were probably countered with reference to national 
defense concerns. 
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The company at Rostock also had a test stand hall for medium-sized diesel 
engines (4,000 hp) to repair engines for civil purposes, as well as a newly developed 
test engine of 3,000 hp. This hall contained about ten test stands that, apart from 
the new engine, carne without noise protection such as encapsulation. In order to 
cope with the deafening noise, the personnel in charge of the new engine’s test 
stand built a noise-reduced room by themselves containing the most important 
measuring and control instruments. However, the other workers in this hall and in 
the two other areas were subject to the noise in varying degrees. 

In the large hall with four test stands for engines up to 20,000 hp only one was 
working at a time. These stands were located in the northern part of the three linked 
halls. The middle part was used for subassembly and some parts of mechanical 
production, while the Southern part of the building was used only for mechanical 
production. Two-thirds of the outer walls of this hall consisted of glass, whereas the 
interior of the hall had no dividing walls whatsoever. None of the designers at the 
time of its construction seemed to have thought about noise protection. This meant 
that noise emissions at test runs affected the whole hall, as well as employees work- 
ing in the halPs extensions. As in the other halls with test runs, it was not so much 
the low frequencies of the engine itself but the piercing high frequencies of the 
exhaust gas turbocharger that created the most problems. In the 1950S, when the 
hall was built, its design was regarded as particularly innovative and progressive, 
but later the hall seemed to many workers to sit there for eternity, unalterable and 
grand. The only way out was to avoid test runs in this hall altogether and to build a 
new one cióse to the water, which would also have had the advantage of easier 
transport. Plans to build this hall started in late GDR times. Lack of funds, however, 
prevented them from being pursued, and they were not taken up again until the 
early 1990S, when the Diesel Engine Works Rostock became part of the Vulcan 
Shipyard of Bremen. Vulcan built the new hall, but the company went bankrupt 
shortly afterward, and today the hall is owned by the Caterpillar Diesel Engine 
Company of Rostock. 

This brief case study shows that a large gap existed between the ambitious aims 
of industrial noise abatement in the GDR and actual shop-floor practice. This was 
not due to a lack of expertise in noise abatement but to lack of funds and to the 
government’s attitude that, at least in the Rostock case, industrial noise abatement 
played a secondary role to other goals such as maintaining production. In the 
Rostock case the future of the factory was severely constrained by the way it 
had been constructed in the early 1950S. Until the late 1980S, it seemed impossible to 
make substantial changes in matters of noise abatement. Significant noise- 
reduction designs rarely occurred because they would have required substantial 
R&D funds; as a result, little was done even regarding measures such as encapsula- 
tion of machinery. Consequently, employees resorted to noise-reducing devices 
like earplugs and earmuffs, which, even when used regularly and properly, could 
only partly alleviate the problem. 

In the Rostock case, employees took noise regulations seriously. With regard to 
test-run installations they had no other choice because the piercing noise emitted 
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by exhaust gas turbochargers was unbearable. However, also in areas with less noise, 
ear-protection devices were generally worn: If caught without them, employees 
faced severe reprimands from the foreperson and were publicly criticized at the 
next session dealing with work hygiene instructions. 


What about the “Working-Class 
Exploiters”? Industrial Noise 
Abatement in the FRG 


Having been somewhat harsh about the practice of industrial noise abatement in 
the Germán Democratic Republic, the question arises as to whether things were 
much better in its capitalist counterpart, the Federal Republic of Germany. In 
theory, and arguing from a GDR socialist point of view, this could not have been 
the case because in capitalist countries workers are exploited, and profit is all that 
counts. However, if entrepreneurs were to profit from noise-abatement measures, 
they would, according to this reasoning, surely introduce efficient noise-abatement 
devices in their faetones. What about the State, governmental institutions, and 
others bodies? According to Marxist-Leninist lore, they would do what big business 
expected from them. 

In the 1950S, industrial noise abatement in the FRG in fact did not play a large 
role. In this time of “reconstruction” similar to the immediate post-World War 
One period, other matters took priority. However, along with rising material wel- 
fare and a growing “environmental consciousness” from the early 1960S onward, 
topics such as noise and noise abatement began receiving more attention. In the 
FRG the Association of Germán Engineers again became active, and in 1960 it 
founded the Technical Noise Abatement Commission, which developed guidelines 
for acceptable noise levels on the shop floor and in public spaces. In 1968 the 
Technical Directive regarding Noise was established; it stipulated that, as far as 
noise emission was concerned, machinery used in factories had to be State of the art. 
This referred to new machinery, whereas in the case of existing plants secondary 
measures of noise abatement had to be taken. In 1974 the federal government 
enacted the Federal Emission Law, which articulated general requirements within 
which the different Germán States and factories had some freedom to act (Koch 
2002, 235-36; Liedtke 2005; Scheuren 2002). 

However, it was not so much an amorphous rise of “environmental conscious- 
ness” but hard faets, or, put simply, money that prompted industrial plants to do 
something about noise. As hinted at earlier, in Germany in the late 1920S hearing 
impairment and hearing loss were recognized as occupational diseases in the metal 
industries. This was extended in 1961 to other branches of industry. It meant that 
the employer’s liability-insurance associations and pensión offices had to pay 
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for the problems these disabilities created, while at the same time the number of 
workers taking early retirement rose quickly. This is why, from the early 1960S 
onward, primary and secondary measures to curb industrial noise increased. The 
Germán engineering industry, a large exporter in the world market, had sufficient 
R&D funds to develop new noise-reduced machinery because this was also, as 
already mentioned in the case of the GDR, a hallmark of quality (Scheuren 2002, 
208-10). 

But what happened on the shop floor? This is much more difficult to assess 
than delineating the debate on setting standards, on legislation, or on the acousti- 
cians’ differences in measuring noise. Research on this issue yields an uneven pie- 
ture. Not surprisingly, the noise-abatement activities in Germán industrial firms 
differed considerably depending on the industrial branch, on firm size, and on 
managers and owners. Still, one thing is clear: In terms of noise abatement, the 
West Germán industry did not simply outperform the GDR. Thyssen in Duisburg, 
a large metal-working company that seems a representative example to investígate 
for our purposes took until 1976 to initiate a large campaign on noise abatement on 
the shop floor. Its main purpose was to convince workers to wear noise-protection 
devices. Its outeome was a notable improvement, without creating an optimal 
situation: Before the campaign only about 20 percent of the workers had used 
ear-protection devices; afterward, this figure had risen to about 60 percent. 9 

Let us look more closely at this campaign and the workers’ reactions to it: In 
late 1976 the Employer’s Liability Insurance Association of the federal Germán met- 
allurgy industry started a campaign to fight occupational disease number 1, occupa- 
tional hearing loss. It distributed posters in plants, organized talks, and took other 
measures to increase employees’ awareness of this problem. “Not to wear hearing- 
protection devices shows a primitive attitude; people who enjoy living in noisy 
environments, who cause noise themselves, and who regard noise as proof of their 
industriousness should change their attitude as soon as possible. Protection against 
noise is protection of the self.” 10 The shop council and the industrial relations direc- 
tor of the Thyssen Steel company in Duisburg pointed out that noise-measurement 
activities had been conducted at the company since 1969. Also, the insurance asso- 
ciation’s 1974 regulation concerning occupational noise, which had legal status, 
determined that an ear-protection device had to be worn in noisy areas in the com- 
pany. Employees not complying with this committed a regulatory offense and were 
hable to a fine of up to 20,000 Deutsche Mark, a substantial sum. In cases in which 
employees’ hearing had already been damaged because of occupational noise, pen- 
sión payments could partly or completely be stopped if employees did not wear 
ear-protection devices. 11 

The campaign undertaken by the employers’ liability insurance association 
tried hard to counter employees’ reasons not to wear ear-protection devices. 
Although they might be uncomfortable to wear at the beginning, the insurance 
association argued, it would only be a matter of time before employees would 
become accustomed to them. In this the devices were similar to a pair of safety 
glasses that wearers also had to get used to. Moreover, why, the employers’ liability 
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insurance association asked, did people not create a similar fuss when required to 
wear protective glasses? With ear-protection devices, it seemed, the issue of manli- 
ness was at stake. The employers also dealt with the argument that, wearing ear 
protection, employees were no longer able to distinguish between different sounds. 
This, according to them, was not a valid argument. After all, machine sounds did 
not vanish completely; they were only abated, and their sound was changed. Getting 
used to detecting them merely involved a learning process. After a short time, 
employees would learn to recognize the now fainter, altered sound; it was similar to 
the situation an apprentice faced in an industrial company: The trainee, too, had to 
learn to make sense of the machine sounds. Moreover, according to the insurance 
association, the argument that speech communication was not possible when wear- 
ing earplugs was unsound. Physical laws showed, the association argued, that only 
high-frequency sounds were abated but not sounds relevant to speech communica- 
tion. Still, it was necessary of course to speak loudly and distinctly in order to make 
oneself understood in a noisy environment. 12 

Although these arguments sounded plausible, they, being arguments voiced 
by one side, the employers, were somewhat simplified and could not completely 
offset the reservations of many in the workforce. This applied even more to the 
complaint many employees voiced about inflammations of the auditory channel. 
This happened quite frequently and prompted employees not to wear earplugs. 
However, according to the employers 5 argument, these inflammations were caused 
by badly fitting earplugs or, even more frequently, by earplugs that had not been 
kept properly clean. In cases of auditory-channel eczemas or chronic middle-ear 
diseases, wearing earplugs was in fact forbidden. In these cases an otoplastic device 
was recommended. 

The campaign at Thyssen did have some effect. As stated earlier, workers 5 
acceptance of ear-protection devices significantly improved, but the ambitious aim 
to convince every worker to wear earplugs was not reached. As indicated, before the 
campaign only about 20 percent of the workers had used ear-protection devices, 
while after the campaign this figure had risen to about 60 percent. 


CONCLUSION 


This chapter has pursued only a few aspects of a large subject. Comparison of 
noise and noise abatement in industry during the Third Reich, the FRG, and the 
GDR has revealed that only in the FRG was industrial noise abatement moderately 
successful and that this carne about only from the 1970S onward. The Germán engi- 
neering industry, a large exporter in the world market, had sufficient R&D funds 
available to develop noise-reduced machinery. Noise reduction was also regarded 
as a hallmark of quality. On the shop floor the implementation of noise-reducing 
measures improved mainly because of stricter insurance association regulations. 
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For insurance organizations the rapidly increasing numbers of insurance claims 
had made it necessary to strengthen their efforts in reducing occupational hearing 
loss. Undertaking primary efforts to reduce machine noise through engineering 
design was a matter of funds. During the Third Reich the incentive to do so 
was low because other priorities prevailed. In the GDR the relevant know-how 
was certainly there, but scarce funds and the fact that only a small portion of 
exports went to capitalist countries did not lead to any serious effort to reduce the 
problem. 

Were the employees responsible for the fact that not more was done earlier? As 
my argument demonstrates, the reticence of workers vis-á-vis hearing-protection 
devices made sense, at least to a considerable extent. Their attitude can be accounted 
for with reference to legitímate safety concerns, while notions of “manliness” also 
played a role. Moreover, music on the shop floor served to mask industrial noise 
and, as some believed, increased productivity. 

Much more research is needed to enhance our understanding of industrial 
noise and noise abatement in the twentieth century. A glance at the annual bibliog- 
raphy of the Journal of the American Society of Acoustical Engineers makes it clear 
that in recent decades a great deal of research has been carried out on various 
aspects of this topic. This work may provide the basis of an interesting comparative 
study of industrial noise and noise abatement measures in industrialized countries 
in various parts of the world. Topics such as the history of sound-measurement 
devices and standardization in an international historical perspective deserve 
thorough analysis. Moreover, there is hardly any research on what actually took 
place on the shop floor. Race, class, and gender are crucial categories in today’s 
historical scholarship, but we have little knowledge about them in the context 
of industrial noise and noise abatement. In other words, much work remains to 
be done. 


NOTES 


1 Karin Bijsterveld (2008) has “Turning a Deaf Ear to Industrial Hearing Loss” as one 
of the subheadings in her chapter on industrial noise and noise abatement. 

2 I am grateful to Professor Hans Lazarus for several hints regarding industrial noise 
and noise abatement (email, Sept. 4, 2009; telephone conversation, Aug. 27, 2009). 

3 See also Kammer der Technik: Bezirksverband Gera und AG (Z) Lármschutz, 7. 
Konferenz Lármschutz: Fortschritte der Larmbekampfung — 30 Jahre Lármschutz . 2 vols., 
Gera, Germany: Kammer der Technik. Kampf dem Lárm, 1987. Also see Unsere ATH 
22(11/12) (1976). 

4 For the following information and for information on conditions in the Diesel Engine 
Works Rostock, with which I deal later, I am very grateful to Professor Franz Spychala of 
Rostock. He was employed at the Diesel Engine Works Rostock from 1960 until 1990 in 
increasingly responsible positions: engineering designer (1960-1961), development 
engineer (1968-1970), head of the development department (1970-1977), and head of 
development-controlling shipbuilding (1978-1990) (email, Aug. 26 and 28, 2009). 
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5 See Kammer der Technik, vol. 2, 83. 

6 See ibid., vol. 1, 14. 

7 See ibid., vol. 2, 15. 

8 “Lármbekámpfung mehr denn je aktuell.” Sozialversicherung/Arbeitsschutz 7 (1977): 16. 

9 I wish to thank Professor Manfred Rasch and Andreas Zilt, MA, Thyssen Archives 
Duisburg, for useful hints on the Thyssen case. 

10 “Kampf dem Lárm.” Unsere ATH 22(11/12) (1976): 3. 

11 Ibid., note 10: 17. 

12 Ibid. 
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CHAPTER 3 


“SOBBING, 
WHINING, 
RUMBLING”: 
LISTENING TO 
AUTOMOBILES AS 
SOCIAL PRACTICE 


STEFAN KREBS 


Introduction 


On a hot summer day in 1950, a Bavarian commercial traveler decides to bring his 
car to a nearby repair shop. When driving, he occasionally hears an inexplicable 
slapping noise, and he worries that this might be an audible indication of a techni- 
cal problem. The mechanics at the garage search for the source of the noise. In 
particular they check the carburetor and the ignition, which often cause trouble, 
but they are unable to find the slightest malfunction. When the motorist returns 
that same afternoon to pick up his car, the mechanics ask him to describe the slap- 
ping sound in detail, as well as when and where it is most noticeable. He explains: 
“Every day I drive down a long, straight Street lined with poplars, and there I always 
hear the slapping noise” (Anonymous 1950b, 353). 1 The mechanics burst out laugh- 
ing because they immediately understood that the driver was merely hearing the 
regular echo of the car as the sound was being reflected by the trees. 
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This anecdote, together with three similar stories, appears in the August 1950 
issue of the Germán trade journal Krafthand — a major periodical for car mechanics 
and dealers. All four accounts emphasize the superior knowledge car mechanics 
possess. If the opening anecdote suggests that even experienced drivers, such as 
commercial drivers, lack the necessary techniques to diagnose audible malfunctions, 
it also underscores the general importance of listening as a means of noticing 
technical problems and the general difficulties of locating a sound source and 
attributing meaning to it. 

In sharp contrast to the “tree echo” episode from the 1950S, Germán motorists 
in the 1920S in fact appeared to possess the necessary listening skills to diagnose 
malfunctions. As I show in this chapter, they even questioned the sonic expertise of 
their car mechanics. So when did listening first become an exclusive domain of 
Germán car mechanics? Furthermore, why did drivers of the 1920S and their coun- 
terparts of the 1950S listen to their cars differently? To tackle these questions, this 
chapter describes the relevant listening practices and explains how and why they 
differ. I focus on what Kevin Borg calis technology’s middle ground, the “ambigu- 
ous space between production and consumption” (Borg 2007, 2-3) and explore the 
complex relationship between Germán auto mechanics and motorists during the 
interwar period and the first years after World War Two. 

My first hypothesis is that the two actor groups (motorists and auto mechan- 
ics) gradually developed two different “modes” of listening: 2 “listening while 
driving” and “diagnostic listening.” To investigate the genealogy of these modes 
I conceptualize them as social practices in accordance with Pierre Bourdieu’s theory 
of practice. As a technique du corps , or technique of the body, listening is a practical 
sense (Bourdieu 1990, 66-79). Marcel Mauss (1936) describes techniques of the 
body as part of the habitus; they vary with educational method, decency, fashion, 
and prestige. Following Mauss, Bourdieu conceptualizes techniques of the body as 
“the socially informed body , with its tastes and distastes, its compulsions and repul- 
sions, with, in a word, all its senses , that is to say, not only the traditional five senses 
. . . but also the sense of necessity and the sense of duty, the sense of direction and 
the sense of reality” (Bourdieu 1977, 124). 3 For Bourdieu, agents incorpórate into 
their habitus the social logic that constitutes the field. He posits a homologous rela- 
tion between habitus and field (Bourdieu 1999, 138-46). Accordingly, I assume that 
listening to automobiles as a social practice is structured by the habitus of the actors 
and the field they act in. 

In addition to Bourdieu’s theory of practice, I suggest using the notion of the dis- 
positive to further investigate the genealogy of the listening practices. According to 
Michel Foucault, the dispositive is a heterogeneous ensemble “consisting of discourses, 
institutions, architectural forms, regulatory decisions, laws, administrative measures, 
scientific statements, philosophical, moral and philanthropic propositions — in short, 
the said as much as the unsaid” (Foucault 1978, 119-20). Within the dispositive 
discourses and institutionalized power structures affect nondiscursive practices 
(Jáger 2001). In my case study, I assume that the nondiscursive practice of listening to 
automobiles is shaped by the power structure between mechanics and automobilists. 
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This brings me to my second hypothesis: The formation of a particular Germán 
auto mechanic’s dispositive in the 1930S altered the power structure in the field, 
giving rise to the differentiation of the two listening practices by excluding motor- 
ists from the realm of <c diagnostic listening.” In the first two sections I explore the 
listening techniques of motorists and mechanics in the 1920S. The next section 
describes the repair crisis at the end of that decade and the new legislation concern- 
ing the auto mechanics’ trade in 1934. Subsequently, I focus on the automotive 
technology of that time. The final section discusses the differentiation of the two 
listening practices. I have analyzed different trade journals for automotive engi- 
neers, auto mechanics, and garage owners, as well as special-interest journals for 
automobilists. Particularly the periodicals of automobile clubs turned out to be rich 
historical sources as they contained hundreds of letters from car drivers, who 
described technical problems — often mentioning their listening practices. In addi- 
tion, I considered contemporary handbooks for car owners, chauffeurs, and auto 
mechanics. 


Listening to Automobiles: 
The Driver’s Experience 


My inquiry starts at the end of World War One, during which the prívate use of 
automobiles had virtually ground to a halt (Ruppel 1927, 10). At that time, members 
of the Germán automobile lobby, such as the Imperial Automobile Club ( Kaiserlicher 
Automobil Club), anticipated a growing demand for prívate motorization in peace- 
time. They argued that the military use of cars and trucks had proven the usefulness 
and reliability of automobiles as a means of mass transportation (Kónig 1919). The 
postwar car was imagined to be small, light, economical, and effortlessly drivable 
without the help of a mechanic (Ledertheil 1919). 

Experts projected that the self-driving tradesman and other middle-class people 
would be the most likely future car owners. Rudolf Hessler, author of Der Selbstfahrer 
(The Self-Driver) (Hessler 1926, 7), presupposed a reasonable level of technological 
knowledge as common among these potential motorists. Handbooks and driving 
manuals were intellectually demanding, and the división between professional and 
popular automobile journals was not as sharp as today . 4 Accordingly, there was a 
sense that the new categories of car owners were going to replace the chauffeur in 
his two roles as driver and mechanic (Borg 2007, 13-30). This continuum between 
chauffeurs and postwar motorists is visible in the specific concepts used in the 
literature of the time. A great number of instruction books equally addressed pro- 
fessional chauffeurs and self-driving automobilists (Parzer-Mühlbacher 1926; 
Hacker 1932; Martini 1938). In concepts and technical level, even specialized alma- 
nacs for chauffeurs did not differ considerably from more general instruction books 
(Martini 1922). 
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It seems worthwhile, then, first to examine the chauffeurs’ practice in terms of 
their concerns with listening and automobile sound. Besides driving, their main 
duties included maintenance and repair work. Only major engine jobs that needed 
special equipment were to be left to repair shops (Martini 1922, 13). Chauffeurs 
needed a high level of technical knowledge and driving know-how (the latter was 
acquired exclusively through experience). In addition, chauffeurs had to have a 
good sense of hearing: “The ear should rigorously register the finest deviance of the 
engine sound” (Kónig 1919, 12). While driving, the chauffeur had to listen carefully 
to the machine (Küster 1907, 10), and if he detected any dissonance, he had to 
look for its source and decide whether it required immediate repair (Martini 
1922, 207). One could develop a fine sense of the engine’s rhythm and timing only 
through experience, but theoretical understanding of automotive technology 
counted as a prerequisite for proper diagnosis (Küster 1907, 10). In summary, pro- 
fessional drivers had to learn to listen to automobiles through hands-on experience, 
and they had to acquire the appropriate technological knowledge. This technique of 
listening and the embodied cultural capital were requisite for success as a good 
chauffeur. 

As already indicated, self-driving car owners would find similar advice in the 
contemporary sources of the day. In Ohne Chauffeur (Without Chauffeur), one of 
the earliest handbooks that appeared in as many as thirteen editions between 1904 
and 1930, the motorist was urged to “listen to the desires of his engine” (Schmal 
1912, 10). Other manuals stressed the importance of regularly listening to the sound 
of the engine (Hacker 1932, 62). Once the driver noticed any disco rd, he was to drive 
carefully and “open up his ears” (Hacker 1932, 83). Furthermore, drivers were 
instructed to avoid any unnecessary noise because only when the engine “runs as 
quietly as possible, any malfunctions that might arise can be noticed plainly and 
early” (Hessler 1926, 217). One even suspected that too much noise would harm the 
drivers’ sense of hearing, thereby affecting their sensitivity to the “desires” of the 
machine (Hessler 1926, 203). In general, all motorists could train their ears: 

With growing experience and habit even the beginner learns to focus his atten- 
tion on other things, especially his own car, without being distracted from the 
road. It is primarily the rhythmic and silent run of the engine that requires his 
attention. The regular humming of the gearbox or chain drive indicates that 
everything is in best order. He will soon notice that every engine and every car 
has its own pace and that even the slightest technical problem alters this lovely 
rhythm. He will involuntarily listen to this pace very closely, thereby avoiding any 
greater malfunctions. A knock or rattle of the engine, a crunch of the chain, a 
rattle of a bolt will indicate the spot where the car needs maintenance, and he will 
do well to follow the slightest hint to repair malfunctions in time before they 
grow worse. (Küster 1919, 304) 

As this quotation from the handbook Das Automobil und seine Behandlung 
(The automobile and its maintenance) suggests, motorists needed time to get to 
know their car and to develop the necessary listening skill to understand what the 
engine “said” to them. They had to learn to distinguish the familiar sounds of a 
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properly running vehicle from the deviant sounds that indicated problems, a prac- 
tice Karin Bijsterveld calis “monitory listening” (Bijsterveld 2008, 77-78; 2009). 

Such monitory listening techniques, which constitute the social practice I cali 
“listening while driving,” are just one side of the coin. As mentioned earlier, auto- 
mobilists were to undertake supplementary diagnoses of the malfunctions they 
noticed while driving. Here, too, listening played an essential role: <c [N]oise is most 
important for the detection of technical fault” (Hessler 1926, 216). In comparison to 
the technique of monitory listening, the practice of “diagnostic listening” was much 
harder to achieve: “This skill, to make the correct diagnosis out of a knocking sound, 
requires tremendous experience and exact knowledge of the type of engine con- 
struction” (Hacker 1932, 83). Maintenance almanacs for motorists provided help, 
however; they offered a systematic overview of possible malfunctions, symptoms, 
and ways of repairing. Oskar Hacker’s manual (1932), for example, categorized mal- 
functions by the senses: seeing, hearing, smelling, and feeling. He needed no less 
than twenty-nine pages to list a wide array of audible failures, a systematic approach 
that was also adopted in journal articles. Other manuals ordered the maintenance 
section by components or combined the explanation of the engine’s function with 
potential malfunctions (Schmal 1912; Küster 1919). Implicitly, handbook authors 
began codifying car sounds by transforming their auditory experience into com- 
municable signs and meanings, if only with limited success, as we will see later. Jens 
Lachmund has described similar problems that physicians encounter when listen- 
ing to a patient’s body (Lachmund 1999, 420). 

All of the authors emphasized the importance of technical knowledge and clear 
thinking: Motorists had to carry out a systematic inquiry to achieve their objective. 
As guidelines, the manuals contained tables, lists, and fault trees. They could be 
used to perform a differential diagnosis to narrow down the list to a single condi- 
tion and provide a basis for a hypothesis of what was ailing the “patient” (Hacker 
1932, 18). It is no coincidence that this resembles the physician’s routine: “As in the 
relation between a physician and his patient, the diagnosis itself is the most impor- 
tant thing to a driver” (Hessler 1926, 216). 

Further, the motorist was advised to ask specific questions about the different 
symptoms, the components that might be “infected,” or the specific driving condi- 
tions under which the malfunction was most noticeable — just like a physician 
taking a patient’s medical history. “If the physician cannot make his diagnosis by 
the appearance of the patient, he will take his stethoscope and listen to the patient’s 
body. This is how you ought to proceed with your car engine as well” (Hessler 1926, 
216). To listen at definite spots, the driver could use a screwdriver or a long metal 
pipe as a simple ear trumpet (Hacker 1932, 81; chapter 12). A driving manual of the 
Germán Association of Motorists (Allgemeiner Deutscher Automobil Club) stated 
that, with a stethoscope, a trained ear could indeed lócate a single dry-running 
bearing (Dietl 1931, 324). Corresponding hearing devices, such as the Auto-Doktor, 
were advertised in motoring journals (Anonymous 1929a). 

As mentioned earlier, handbook authors recognized automobilists as having 
the ability to make a range of significant distinctions between acoustic phenomena. 
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Figure 3.1 Cartoon of a car mechanic as a physician, from the Briefkasten section of the 

Allgemeine Automobil-Zeitung 33(42) (1932): 19 (Jonny). 


Furthermore, they were seen as eager to acquire the expertise to repair their cars 
themselves. Nonetheless, why should they want to acquire the needed knowledge 
and experience, including the hard-to-learn technique of diagnostic listening? The 
magazines and handbooks of the time agreed on two intersecting aspects: First, self- 
repairing was simply cheaper and, if done in time, useful in avoiding more serious 
technical problems. Second, car owners were obliged to monitor their cars to avoid 
any accidents that might occur due to technical problems. Even if they employed a 
chauffeur, they were still legally obliged to monitor their car’s condition, as well as 
the chauffeur’s driving habits. Thus, employers needed the appropriate technical 
knowledge and experience to supervise their chauffeurs, in particular if they doubted 
their employees’ trustworthiness (Anonymous 1926). 

From 1928 onward, the Allgemeine Automobil-Zeitung carried a section called 
“letter box” (Briefkasten), which regularly addressed legal and technical problems. 
Until the outbreak of the Second World War, this journal received some fifteen 
thousand letters, a selection of which was published in each issue. Some reveal con- 
cerns about motorists 5 listening practices. In general, car owners followed the advice 
given in the driving manuals; they carefully monitored the running of their 
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machines — they “listened while driving.” Their descriptions were detailed accounts 
of the specific driving conditions under which a suspicious noise occurred. To 
¡Ilústrate their cars’ audible idiosyncrasies, drivers used a wide range of adjectives: 
Their cars were sobbing, whining, rumbling, as well as stuttering, hammering, 
knocking, singing, howling, growling, ticking, hissing, and droning (Briefkasten 
1928c). The letter writers additionally referred to other common sounds to depict a 
specific noise, such as the chirps of a cricket (Briefkasten 1930). Other letters 
described the frequency and pitch of the noise (Briefkasten 1929a). In addition, they 
listed tests and repairs they had already carried out and claimed their own expertise 
by referring to their personal experience: “I have been a self-driver for twenty-two 
years now, and I know a lot about engine designs, but this time I am helpless,” one 
driver grumbled (Briefkasten i928d). 

Often the editors had difficulty making sense of the written accounts because 
motorists, despite the attempts to codify car sounds in handbooks and journals, 
shared no standardized vocabulary to describe their auditory experiences. A knock- 
ing, for example, could indicate different malfunctions: spontaneous ignitions as 
well as worn-out pistón bearings. In difficult cases the editors gave general sugges- 
tions or described diagnostic strategies to narrow down the range of possible faults 
(Briefkasten 1929b). Sometimes the communication failed completely: A “hot 
noise,” as one reader wrote, made no sense (Briefkasten 1938). To stress their own 
expertise, the editors stated that experienced mechanics needed more than ten years 
to develop a “trained ear,” allowing them to trace any noise to its exact source 
(Briefkasten 1933). Despite this emphasis, the Briefkasten advisers always treated 
the readers as technically competent and believed them capable of major repairs 
such as cleaning the oil-carbon deposit in the combustión chamber (Briefkasten 
1928a). 

To summarize, motorists in the 1920S developed two intertwined listening 
practices. First, they monitored their cars and “listened while driving” to detect 
technical problems in time. Second, they acquired the technique of diagnostic lis- 
tening. A 1926 manual suggested that the experienced driver and the efficient chauf- 
feur could both be recognized by their diagnostic abilities (Parzer-Mühlbacher 
1926, 333). Clearly, motorists were oriented toward the chauffeur’s techniques of 
listening, which, in the case of middle-class automobilists, was mediated and 
reflected in written instructions (Bourdieu 1990, 74). This explains some of the 
problems they encountered while making sense of what they had heard. However, 
it was a technological necessity for self-drivers to learn to recognize audible signs of 
malfunctions. Motorists embodied both listening practices as techniques du corps ; 
they engraved these techniques into their habitus. Displaying these techniques 
became a means of social orientation: They considered car repair as matching 
their upper-middle-class social standing. Car repair became a bourgeois cultural 
technique: While car driving in Germany was still an exclusive and expensive occu- 
pation, the associated technical expertise was a distinctive sign and a sign of distinc- 
tion (Wetterauer 2007, 155-66). Furthermore, motorists who acquired diagnostic 
capabilities could compénsate for their distrust of chauffeurs and auto mechanics. 
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Listening to Automobiles: 
The Mechanic’s Experience 


Despite great expectations after the war, mass motorization did not take off in 
Germany in the interwar period. The number of passenger cars rose from 60,876 in 
1914 to 216,300 in 1925. At that same time, there were 17,726,507 passenger cars in the 
United States; 778,211 in the United Kingdom; and 573,397 in France (Ruppel 1927, 
10, 15). 5 Still, the car-repair business saw strong growth in Germany, with some 
20,000 repair shops in business in 1929 (Reparatur-Werkstatt 1929, 1-2). This figure 
included a huge number of workshops run by blacksmiths, tinsmiths, and fitters, 
who repaired cars on the side, which is why Kevin Borg appropriately refers to this 
group of auto mechanics as ad-hoc mechanics (2007, 31-52). 

In contrast to other trades in Germany, the auto mechanics failed to be legally 
regulated, and it was regarded as socially legitímate for chauffeurs, retired army 
drivers, or craftsmen from a wide range of trades to repair cars. The blurred bound- 
aries of this field are reflected in journals such as Auto-Technik , which, in the first 
postwar decade, catered to car dealers, garage owners, mechanics, and automotive 
engineers. Renamed Automobiltechnische Zeitschrift in 1929, it became the landmark 
of the Germán automotive engineering profession. In 1928 a monothematic jour- 
nal, Die Reparatur-Werkstatt , renamed Krafthand in 1930, was initiated to support 
the ad-hoc mechanics, as well as specialized auto mechanics, with news from the 
trade, articles on automotive technology, and repair tips. 

Journals for Germán auto mechanics frequently ran articles on car-related 
sounds, noises, and listening. A special section in Auto-Technik titled a For the 
Repair Shop,” for example, gave advice on how to get rid of minor noises, such as 
the rattling of the brake linkage. More in-depth articles explained the technological 
background of new noiseless chain drives and engine knock (Ostwald 1921, 1922). In 
this context, authors discussed the knock resistance of antiknock fuels. They 
described engine knock as starting with occasional plinking and increasing up to 
loud and frequent detonations and claimed that, for the time being, in the absence 
of measuring devices, the best instrument to determine the antiknock quality of 
fuels remained a well-trained ear, mainly because of its great sensitivity to differ- 
ences in frequency (Enoch 1928). Interestingly, this ties in with Marcel Mauss’s 
notion of the body as the first and most natural technical means (1936) and under- 
scores the crucial significance of listening in the field of auto mechanics. 

As with motorists, mechanics relied heavily on their own listening skills to 
diagnose technical malfunctions. The article “How to Diagnose Malfunctions of 
Passenger Cars” explained different noises, their technical sources, and ways to 
repair them (S. 1928). Another contribution distinguished a multitude of abnormal 
sounds. The author categorized noises emanating from the engine, the drive train, 
the body, and the chassis. He gave details on how to diagnose every sound, he 
described particular driving conditions typical for this sound, and he advised the 
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disassembly of certain components to rule out other possible sound sources. As one 
anonymous source put it: “For diagnosing engine sounds a very fine sense of hear- 
ing is required indeed, especially when the sounds are very faint and if several 
sounds from different sources have to be distinguished simultaneously, which is 
often the case” (Anonymous 1932, 81). The author elaborated that experience was 
essential for “diagnostic listening” and stated that a lack of practice could not be 
compensated for by using a hearing device: “Those who are unaccustomed to the 
use of a wooden rod or a stethoscope, which are both put against the engine from 
the outside, will easily be misled by the effect of resonance” (Anonymous 1932, 81). 

Other authors were much more optimistic that hearing devices could assist the 
auto mechanic — especially when it was important to focus on definite spots. Auto- 
Anzeiger , a trade journal not distributed to ordinary motorists, presented a special 
stethoscope with two sensors: the Tektoskop and the Tektophon. This type of 
construction should enable the mechanic to examine two engine spots at once, 
thus allowing him to compare two sounds in great detail (Anonymous 1929b). 
Labeled ÍC the ideal troubleshooter,” the Meccano-Stethoskop was described as a 
brilliant tool: With the help of the stethoscope one could “clearly observe the 
processes inside the engine” and save the time by not having to disassemble the 
engine when diagnosing problems (Anonymous 1930). The Meccano-Stethoskop 
provides another indication of the general importance of listening in the auto 
mechanics trade. The claim that it could help to speed up the diagnosis appealed 



Figure 3.2 Photograph illustrating the use of a Tektoskop: Anón. 
“Wo entsteht das Geráusch?” Auto-Anzeiger 4(40) (1929b): 2-3. 





88 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


to workshops, as customers expected a prompt and exact estímate. However, 
because of the difficulty of making a precise sound diagnosis, Auto-Technik warned 
the auto mechanics among its readership to give only nonbinding estimates 
(Walkenhorst 1926). 

Auto-Technik' s warning about making estimates indicates two crucial intersec- 
tions of the practice of diagnostic listening: mapping and verbalization. The notion 
of sound mapping comes from the field of psychoacoustics. It refers to the linking of 
sounds to the information they represent (Fricke 2009, 55-56), and it is a useful 
metaphor for grasping the practice of diagnostic listening as the association of par- 
ticular car sounds with specific malfunctions. The concept of mapping helps to 
explain why stethoscopes and other hearing devices alone were of little help, as it is 
not sufficient simply to amplify sounds; one must assign meaning to them as well. 
But how would auto mechanics be able to learn to map sounds? Was it largely a 
matter of theoretical knowledge? Or was bodily knowledge the key to diagnostic 
listening? 

As in the case of motorists’ listening practice, the literature they had available 
agreed on the precondition of theoretical knowledge and the status of practical 
experience (Anonymous 1928; O. Winkler 1928). Tables, lists, and fault trees were 
published to foster methodological investigation of audible defects (Fischer 1927). 
Regular sections in auto mechanics trade journals, such as “From the Workshop 
Practice” (Auto-Technik) and “Do You Already Know?” (Die Reparatur-Werkstatt), 
emphasized the significance of practical knowledge. Regardless of whether the árd- 
eles had a more theoretical or practical orientation, however, both journals strug- 
gled to put the audible indications of different malfunctions into words even though 
such codification of car sounds was a prerequisite for the written transmission of 
the diagnostic listening skill (chapter 4). Engine knock is a good example to ¡Ilús- 
trate this. An árdele specifically devoted to the hammering and knocking of the 
engine described engine knock as a hammering sound, whereas only the audible 
malfunction of a pistón bearing or rod was called knocking (Anonymous 1919). 
Another 1932 árdele distinguished no fewer than seven types of knocking, including 
“metallic knocking,” tc high knocking,” “damped knocking,” and a “muffled clang” 
(Anonymous 1932). Apparently, a standardized set of subtly nuanced terms to ver- 
balize the audible characteristics of malfunctions was not available. Nonetheless, 
how should the mechanic in his everyday practice know exactly what this author’s 
variously described knocking sounds referred to? Similarly, it was crucial to distin- 
guish between these sounds, as the “metallic” one was just a nuisance, but the “high” 
one indicated a serious problem. 

This raised the question of whether written advice on how to listen to malfunc- 
tions was of any use to auto mechanics. Despite his own work, Eugen Mayer-Sidd 
(1931b), a regular contributor to Krafthand , questioned the use of written repair 
accounts: “[I]t is exceptionally difficult to give someone else a detailed and grasp- 
able description of a technical work or method that enables him to do it himself 
later on.” Especially with regard to malfunctions he was very pessimistic: “[I]t is 
even more difficult to describe a malfunction so comprehensibly that someone else, 
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without physical inspection, may give the appropriate advice with certainty on how 
to repair it” (Mayer-Sidd 1931b). The unachieved codification of car sounds and the 
general difficulties of giving written advice underscore the relevance of the tacit 
dimensión of car repair knowledge (Polanyi 1958, 69-245). If written accounts 
largely failed to convey “diagnostic listening” skills, tacit knowledge could be passed 
on only through apprenticeship. 

It is safe to argüe that the motorists’ techniques of listening and the auto 
mechanics’ practice of diagnostic listening were in fact quite similar. This is perhaps 
hardly surprising because specialized auto mechanics shared their knowledge of 
their field with many inexperienced craftsmen from other trades, former chauf- 
feurs, and other ad-hoc mechanics. Until the mid-i920s, they competed as more or 
less competent and legitímate rivals, and all of them might gain advanced expertise 
in diagnosing and repairing passenger cars. If theoretical knowledge generally 
served as a necessary basis, only through practical experience could one really 
develop diagnostic skills. All of these individuáis recognized car sounds as an indis- 
pensable source of information on the engine’s condition and as a means of locat- 
ing particular malfunctions. The ability to diagnose by listening had to be 
incorporated as an embodied technique, and the mechanics’ literature emphasized 
the considerable effort this took. Only a practically trained auto mechanic could 
diagnose a malfunction by listening to it, but even then pardal disassembling was 
still regarded as advisable. 


The “Repair Chaos” and the 

FoRMATION OF THE AUTO 

Mechanics Dispositive 


In December 1926 the Germán Chambers of Industry and Commerce published an 
assessment of the situation in the field of auto mechanics. The study summarized a 
series of complaints about excessive prices for spare parts and repair work. In 
response, the car dealer association denied these shortcomings, but the discussion 
about cheap and reliable repairs did not die down (Anonymous 1926). Several let- 
ters published in the Allgemeine Automobil-Zeitung articulated the rising distrust 
and dissatisfaction among car owners (Briefkasten 1928a, 1928b). 

In 1928 Auto-Technik published an editorial titled “The Great Repair Misery.” 
Apparently, its author considered it his duty to raise this delicate issue, and he even 
felt obliged to criticize parties that paid for ads in this same journal. He leveled 
criticism mainly at auto mechanics and their expertise and trustworthiness, includ- 
ing those employed by car dealers and manufacturers, as well as independent repair 
shops. During the warranty period, for example, the significance of audible techni- 
cal problems was often played down if not altogether denied by the manufacturer’s 
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mechanics: “[T]hey try to persuade the customer that the abnormal sound, which 
indicates an upcoming problem, is of no significance — This does not mean any- 
thing,’ c this is just an imperfection’ ” (Loewe 1928, 11). In other cases motorists 
would bring in a car that obviously had an audible problem, but after the repair the 
difficulty continued. The author of the editorial interpreted this as a sign of the lack 
of the necessary expertise. In sum, he argued that the repair misery posed a serious 
threat to the whole automobile system because dissatisfied motorists might simply 
abandon their cars. 

A corresponding editorial in the Állgemeine Automobil-Zeitung articulated a the 
outcry of the automobilist” and asked readers to propose a way out of the “repair 
chaos.” As evidence, the article mentioned the numerous complaints published in 
the journal’s “letter box.” Motorists, it claimed, preferred to carry out major repairs 
themselves because of their distrust of auto mechanics (Anonymous 1931b). Corning 
from the other end, advocates of the auto repair business saw the origin of the 
“repair chaos” in the unregulated access to the trade: “The blacksmith, the bicycle 
or sewing machine mechanic, the fitter, they all have to learn their trade for four 
years, but anybody who learned to handle a file and followed a six-week course is 
able to repair the complicated and valuable engine of a car flawlessly?!” (Testor 
1931). To put things right, they proposed the establishment of an independent auto 
mechanics trade together with mandatory membership in a guild (Anonymous 
1931a). 

In this respect it is relevant to briefly discuss the overall Germán professional 
trade system. Importan tly, the laws governing trade and industry in Germany had 
been amended in 1897 and 1908. From then on, the right to enter a trade was 
restricted. To practice a trade, a three- to four-year formal apprenticeship was 
required. The passing of a final exam, which earned one a journeyman’s certifícate 
(Gesellenbrief), served as one’s entry ticket to a particular trade. After accumulating 
experience for three to five years, depending on regional customs, journeymen 
achieved the right to take a second exam to obtain a master craftsman’s certifícate 
(Meisterbrief). Only with this second certifícate did one have the right to train 
apprentices. Moreover, every workshop had to become a member of a trade 
guild. Because this was mandatory, the system led to a high degree of organization 
in the trades (Greinert 1994). The journeyman’s certifícate was not only a form of 
institutionalized cultural capital that regulated and restricted access to one’s field; it 
also allowed Germán craftsmen to take up a position of trust, a kind of symbolic 
capital, which structured their social relationships with nonmembers in the field 
regardless of individual skills. Germán craftsmen thus cultivated a particular 
preindustrial mentality, a habitus grounded in a long and painstaking apprentice- 
ship, whereby ultimately a sense of “master craftsman’s honor” and the ideology 
of “high-quality workmanship” served as guiding valúes (Holtwick 1999; Sennett 
2008). 

The craftsman’s position of trust was also part of the discussion in the auto 
mechanics’ community, sparked as it was by complaints about the “repair chaos.” 




91 


“SOBBING, WHINING, RUMBLING” 


Thereby the “physician” played a major role. It was claimed that during their 
apprenticeship, future auto mechanics would develop — under the strict guidance 
of a master craftsman — a distinct sense of responsibility. Just like a physician, the 
mechanic would embody the necessary expertise, as well as commitment to good 
practice. The advocates of an independent trade of auto mechanics argued that 
limited repair courses, as offered in the United States (Borg 2007), were insufficient 
to instill this particular sense of responsibility (Schiff 1930). 

The auto mechanics’ fight for an independent profession was further embed- 
ded in the political action of representatives of the trade organizations to defend 
their members’ social standing and economic position. They argued that the “high- 
quality workmanship” guaran teed by the craft trades was a public good that deserved 
special protection through strict regulation of the field. Andrew Abbott has described 
how such public claims of jurisdiction are often used to enforce the specific inter- 
ests of a profession (Abbott 1988, 59-85). He has also emphasized the role of legal 
regulations, “which can confer formal control of work” (Abbott 1988, 59). Despite 
several initiatives, the craft trades failed to put through their program during the 
Weimar period, but the demands were taken up by the National Socialists. After 
they carne to power, they enacted new legislation in 1934 that introduced obligatory 
guilds for all recognized trades. Furthermore, in 1935 an amendment abolished the 
right to practice a trade, establishing the master craftsman’s certifícate (grosser 
Befáhigungsnachweis) as a precondition for starting a workshop (H. Winkler 1972, 
184-85; Saldern 1979). The first to profit from the new legislative framework were 
auto mechanics. This is hardly surprising as they maintained good relations with 
the National Socialist Motor Corps (NSKK). Furthermore, the discourse about the 
future of the auto mechanics trade meshed well with the National Socialists’ plans 
for mass motorization in Germany (Zeller 2007). Friedrich Stupp, member of the 
Nazi Party and NSKK-Obersturmführer, was elected first president of the Berlin 
auto mechanics guild in September 1933, and only a few years later 80 percent of the 
auto mechanics master craftsmen were members of the NSKK (Hochstetter 2005, 
115-16). 

In response to the urgency of the repair crisis, different groups of actors engaged 
in reorganizing the auto mechanics trade. 6 This led to the formation of the car 
mechanics guild, a new corporate body with the symbolic power to approve car 
repair experts (Bourdieu 1989, 23). The newly established system of obligatory trade 
guilds, apprenticeships, and workshops led by master craftsmen can be understood 
as the formation of an auto mechanics dispositive — as the readjustment of power 
relations within the field of auto mechanics. The formation was determined by the 
structure of the Germán system of trades: its common set of legal restrictions, insti- 
tutions, cultural valúes, social relations, and practices. The societal conditioning 
through the tradition of this system resolved the crisis of confidence between 
motorists and auto mechanics by guaranteeing the trustworthiness of the latter. 
Henceforth, auto mechanics received theoretical training in vocational schools, and 
they embodied the necessary techniques, as well as a distinct habitus, during their 
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four-year apprenticeship. The new cultural capital of the journeymaiTs certifícate 
not only regulated access to the trade but also equipped auto mechanics with an 
unquestionable expertise: The truth valué of that expertise was independent of the 
individual skills of a single mechanic as it was guaranteed by the “honor” of the 
craft professions. Foucault has described how the encounters between individual 
actors become structured by the power relation inscribed in the dispositive 
(1976, 120). 

In comparison, in the United States the status of auto mechanics as trust- 
worthy technical experts remained contested due to the unregulated access to 
the trade. As a possible solution, the auto mechanics sought to delegate diagnos- 
tic authority to instruments and measuring devices — but failed. Also, the intro- 
duction of a flat-rate system did not solve the problem of distrust because 
mechanics started to work hastily and thus did a shoddy job under the pressure of 
standardized work times (Mclntyre 2000, 292). In contrast to their Germán coun- 
terparts, American auto mechanics have suffered an endless crisis of confidence 
(Borg 2007). 


The Ideal of the Silent Car 


Before describing the differentiation of listening practices attributable to the for- 
mation of the auto mechanics dispositive, I first consider two strands of discourse 
on the “silent car” that are closely connected to the contemporary automotive tech- 
nology: noise as an audible sign of technological inefficiency and silence as a cul- 
tural expression of modernity and distinction. 

After 1900, engineers showed increasing interest in reducing the noise of pro- 
duction equipment. As Karin Bijsterveld has shown, “noisy machinery” became an 
“indication of mechanical inefficiency” (Bijsterveld 2006, 328-29). This special dis- 
course entered the automotive engineering community, too: Research on noiseless 
car components, especially noiseless sprocket chains, pertained to theoretical con- 
cerns, as well as the production process. Chain drives had the advantage of being 
quieter than spur wheels. Corresponding articles in trade journals described 
sprocket chains as efficient and modern (M. 1919). 

Advertisements for “silent car components” in consumer magazines clearly 
show the circulation and significance of the special discourse beyond the field of 
engineering. Manufacturen informed automobilists that Fichtel & Sachs roller 
bearings “reduced fuel and lubricant consumption” and “guaranteed the silent run 
of the engine” — or that ZF gearwheels were the “wheels of choice for the silent gear 
box” (Fichtel 8c Sachs 1928; ZF 1930). Science journalist Walter Ostwald (1921, 11) 
commented on the “silent car” craze: “Noiselessness is more than a fashion. A noise 
always points to a waste of energy. Therefore, the longing for noiselessness, if 
fashionable, is at least a good and useful fashion.” 
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Figure 3.3 Detail from advertisement for silent-running pistons: “Running smoothness 

matters!” Allgemeine Automobil-Zeitung 41(49) (1940): 839. 


Furthermore, the advertising of car manufacturers promoted noiselessness 
as a distinguishing mark of modern and elegant passenger cars. The Citroén- 
Phaeton stood out with its “noiseless and flexible working engine” (Citroen 
1928), Brennabor’s Juwel had an “inaudible engine full of adaptable power” 
(Brennabor 1929), and a positive feature of the Primus was its noiselessness, achieved 
by the “generous use of rubber for sound insulation” (Adler 1932). Articles 
offered advice to motorists and auto mechanics on how to get rid of squeaks and 
squawks (Ostwald 1923). These little noises were not just a nuisance but also an 
embarrassment to the car owner. An ad for suspensión covers carne with an illus- 
tration of a young driver whose car’s suspensions screeched so loudly that his 
spouse, as the accompanying poem clarified, preferred to take the train back home 
(Vogelsang 1937). 

“Noiselessness was [not only] the mark of a good car” (K. 1927) but also a sign 
of good manners and distinction as contemporary antinoise advocates stressed 
(Bijsterveld 2008). Significantly, the propaganda of noise-abatement campaigns, 
class habitus, and engineering discourse converged in this set of valúes. Mechanical 
efficiency and social distinction urged car owners to listen to their cars even more 
carefully and to complain about each tiny noise. However, as we will see later, auto- 
mobilists who insisted on the promised “silent run” of their machines were often 
denounced as “noise fanatics.” In the eyes of the mechanics these drivers took the 
advertisements far too seriously. 
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Differentiation of Listening 

Practices 


From 1934 onward, the expertise of Germán auto mechanics was basically guaran- 
teed through their obligatory four-year apprenticeship. During this period the 
future mechanic worked in very cióse contact with a master craftsman or a journey- 
man to learn by observing and imitating. To ensure the quality of their bond, a 
master craftsman was not allowed to train more than two apprentices, while a jour- 
neyman could supervise only one apprentice. During the first six months, appren- 
tices only lent a hand to their mentors, after which they continued with simple tasks 
like lubricating or checking the tire pressure. In the second year, they learned to 
diagnose uncomplicated malfunctions, but they were not allowed to carry out 
maintenance work themselves until the third year of their apprenticeship. In addi- 
tion, apprentices had to learn a wide range of metalworking skills, such as filing, 
drilling, milling, turning, and welding. This set of manual skills emphasizes the 
status of bodily knowledge in the auto mechanics trade (Kümmet 1941; Zogbaum 
1937 ). 

Training manuals did not explicitly mention specific auto mechanics’ skills, 
such as diagnostic listening. Apparently, such skills were considered part of the 
practical learning process — based on imitation of the master craftsman. Inherent in 
the notion of the apprenticeship system was the substitution of codified knowledge 
through the practical skills apprentices would pick up in their years of practice: 
They learned by doing. Douglas Harper has shown how car repair knowledge is 
informally passed on in this manner (1987, 24-31). It involves a practical repertoire 
of imitated actions rather than models, a process of acquisition Bourdieu calis 
mimeticism : “Bodily hexis speaks directly to motor function, in the form of a pat- 
tern of postures that is both individual and systematic, being bound up with a whole 
system of objects, and charged with special meanings and valúes” (Bourdieu 1990, 
74). In a similar way, Harry Collins speaks of the “unconscious emulation” of 
uncognizable knowledge (Collins 2001, 72). 

During the formative period of the auto mechanics dispositive, individuáis 
questioned the position of car owners as knowledgeable amateur mechanics. Under 
the programmatic title “Hands Off,” in May 1933, the readers of the Allgemeine 
Automobil-Zeitung were urgently requested not to repair their cars themselves 
because they lacked the necessary abilities. Furthermore, drivers were instructed to 
ignore little noises that are just a nuisance. Only if they heard a “threatening noise” 
(Gefahrengeráusch) should they bring their car to a specialized repair shop 
(Anonymous 1933). Both articles supported the professionalization of the auto 
mechanics trade. As such they glorified the “high-quality workmanship” and 
devalued the knowledge of all nonexpert mechanics. 

Concomitantly with the stabilization of the dispositive through the 1934 
legislation, the new picture of the lay motorist became clearer. As one author put it, 
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“The rising spread of automobiles goes hand in hand with the diminishing number 
of knowledgeable motorists” (Rdl. 1936). One cause was seen in the technical level 
reached in the automotive industry, which made cars more and more reliable, 
but more important, it was assumed that most people were just “terribly clumsy” 
(Rdl. 1936). The shifting balance between motorists and auto mechanics can further 
be observed in the rhetorical shift from reparieren (repairing) to basteln (tinkering). 
This semantic turn excluded automobilists from the discourse on automobile 
repairs. Furthermore, it sustained the new boundary between expert mechanics and 
lay drivers along the line of technological knowledge (Franz 2005). Automobilists’ 
journals no longer published articles on how to acquire diagnostic skills or pro- 
vided real repair instructions. Instead, they advised the automobile enthusiasts on 
how to tinker with their cars (Rdl. 1938). 

On the other hand, letters that ended up in the Briefkasten section suggest that 
motorists changed their practices only gradually. In the latter part of the 1930S they 
still listened closely to their cars, sent in their self-diagnoses, and asked for advice 
on how to tackle the problems. Flowever, in contrast to the situation several years 
before, when readers were advised to “listen in” themselves, they were now urged to 
consult expert mechanics. In addition, the automobile club journal published lists 
of reliable workshops and informed its readers that most of their trouble with 
mechanics could be traced back to the readers themselves: They were blamed for 
having unrealistic ideas about prices and repair times (Dill 1936). Another article 
argued that motorists should learn not to bother auto mechanics with their self- 
diagnoses: “If your car is more important to you than your rhetorical exercises, let 
the master craftsman do the job” (W. 1938). 

In the late 1930S an interesting discussion aróse about the right of motorists to 
observe the repair process inside the workshop. The Allgemeine Automobil-Zeitung 
argued that, in particular cases, such as the repair of a rare make, a driver could 
poten tially provide help to the mechanic because of his own special knowledge of 
the car (Peter 1938). In a direct response, Krafthand , the trade’s official journal, 
dismissed this as an offensive intrusión into the mechanics’ backstage domain — 
and as a blunt articulation of distrust (Goffman 1939, 111-21). The journal advised 
its readers to follow a strict policy of trust: “The motorist who distrusts a workshop 
should look for another one. One does not go to a physician whom one does not 
trust, either” (Anonymous 1940). The car mechanics’ exclusive access to the actual 
workshop became part of the dispositive. The new official master craftsman’s hand- 
book proposed a ground plan for garages that separated motorists and mechanics 
(Kümmet 1939, 250): The showroom and a special waiting room for customers 
served as the front domain, where the master craftsman met his clients, whereas the 
backstage of the workshop was for mechanics only. In this way the establishment of 
a trusting relationship between mechanics and motorists simultaneously gave rise 
to new barriers. 

The discourse on lay motorists illustrates, moreover, how the trade used this 
discussion to demárcate its new boundaries. The cartoon series “Kunibald, the 
Smart Customer” portrayed the “dull” lay driver. Kunibald is a motorist who tries 
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to diagnose and repair his car on his own, and he often disregards his mechanic’s 
advice, but in the end he always fails and must admit that he should have taken his 
car to the garage (Jonny 1937). Another fictional story describes a neurotic motorist 
who always does everything himself, but, like Kunibald, he finally ruins his car and 
needs the help of a professional mechanic (Windecker 1937). I suggest a reading of 
these narratives as stereotypes in which the emerging auto mechanic’s habitus grad- 
ually became visible. They exemplified the “social viewpoint” of an average auto 
mechanic, and at the same time the “shop talk” structured future encounters 
between mechanics and motorists by suggesting the proper front-stage behavior 
(Goffman 1959, 175-76). 

As mentioned earlier, the struggle for listening expertise culminated in com- 
plaints from mechanics about “noise fanatics”: “As you know, there are so-called 
noise fanatics who can drive a busy master craftsman crazy with their accounts of, 
sometimes real and sometimes imagined, noises they heard” (Anonymous 1938). 
Another árdele elaborated: “He is often bothered by noises that exist only in his 
imagination. The fact that he is always sure where the noise comes from does not 
make him any more likeable because mostly he is wrong, thus leading the craftsman 
down the wrong road” (Anonymous 1939). With the topos of the “noise fanatic,” 
auto mechanics reclaimed the practice of diagnostic listening as their exclusive 
domain: “When looking for a noise source, never ever let yourself be influenced by 
the customer” (Anonymous 1939). By denouncing them as overanxious and 
unknowing, mechanics deprived motorists of their listening expertise. This was not 
a matter of whether or not they actually lacked such expertise; rather, it followed 
from their newly gained status as nonexperts. As a result, the motorists’ practice of 
diagnostic listening was reconfigured as ¡Ilegitímate. 

At the same time, the narrations about “noise fanatics” demónstrate why ordi- 
nary drivers, though expelled from the realm of diagnostic listening, should never- 
theless listen to their cars. One árdele noted: “The expert’s trained ear knows the 
sound of the engine; he distinguishes between healthy sounds and noises that indi- 
cate an upeoming problem. In contrast, the layman is often anxious about harmless 
noises. But it is still better for him to consult an expert in vain than to disregard 
noises until the engine has a serious problem” (Anonymous 1936). The latter type 
of driver was also called the “noise phlegmatic” (chapter 2). Another author 
explained: “He is not worried when his car rattles and squeaks at all ends and when 
the chassis together with the engine plays a free concert. . . . Because the noise 
phlegmatic has a tin ear, this symphony does not disturb him; he will not do any- 
thing until his heap breaks down. We find these people just as disagreeable” 
(Anonymous 1939). Folio wing this rhetorical confrontation, motorists should on 
no account “listen to diagnose” malfunctions, but they should always “listen while 
driving” to recognize technical problems in time. In other words, the “sonic contra- 
diction” urged automobilists to listen carefully but not to listen thoroughly. 

The ban of prívate car use in Germany in February 1942 temporarily suspended 
the struggle between motorists and car mechanics — at a point where the latter 
had gained the upper hand. After the war, journals for automobilists, such as 
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ADAC Motorwelt , predecessor of the Allgemeine Automobil-Zeitung , and Auto 
Revue , revived some of their oíd features and columns. For example, the “travel 
box” section advised readers on their travel plans (Anonymous 1950a). Nevertheless, 
the technology-oriented Briefkasten section was not taken up again in the 1950S. 
Ñor did these journals continué to publish detailed instructions on how to diagnose 
and repair malfunctions as they had done in the interwar period. Apparently they 
no longer recognized readers as interested and competent in these matters: 
Regardless of individual listening techniques, the motorists’ practice of diagnostic 
listening was apparently dismissed as irrelevant or inappropriate. On the other 
hand, trade journals for auto mechanics continued to publish articles on how to 
lócate audible malfunctions. This underscored the role of listening as a diagnostic 
technique (Anonymous 1955). As indicated in the introduction, trade advocates 
also continued their polemics against ordinary motorists, but they were no longer 
the subject of editorials. Instead, these narrations were taken up in cartoons and 
small columns on “real” and instructive incidents. Thus, they became part of a 
folkloristic reassurance of the auto mechanics’ own exclusive expertise. The craft 
professions habitus barred motorists from car diagnostics and contested even their 
monitory listening skills — without denying them the latter practice. 


CONCLUSION 


As I argüe in this chapter, listening to automobiles was a common technique among 
motorists and auto mechanics during the 1920S. At the end of the decade, however, 
the repair crisis posed a threat to the growth of the automobile system, and in 
response to this urgency different groups argued in favor of a rigid organization of 
the trade. This strategic movement led to the formation of an auto mechanics dis- 
positive that altered the balance of power between motorists and auto mechanics. 
The new social logic devalued the knowledge of ordinary automobilists, which can 
be read as an adjustment to the new dispositive (Foucault 1978, 121). In the new 
situation, motorists listened while driving to recognize technical problems in time, 
and mechanics listened to thoroughly diagnose these malfunctions. The genealogy 
of listening practices shows that the claim for expertise in diagnostic listening was 
merely certified by the auto mechanics’ position of power (Foucault 1976, 120). So, 
rather than personal skills, their habitus decided on the truth valué of their knowl- 
edge, and concomitantly the motorists’ habitus was altered by the submission to the 
doxa of the field, which limited their scope of action accordingly. 

NOTES 


1 My translation. I have also translated all of the sources cited in this chapter that were 
not available in English. 
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2 For the distinction between different modes of listening (monitory listening, 
diagnostic listening, exploratory and synthetic listening), see Bijsterveld (2009). 

3 Bourdieu uses the term hexis instead of technique du corps. 

4 In the early postwar years, for example, the same articles appeared in periodicals for 
mechanical engineers and ordinary motorists (Ostwald 1921). 

5 This amounted to 290 persons per car in Germany, 6 per car in the United States, 60 
in the UK, and 71 in France. 

6 For Foucault a societal crisis, urgency , is the trigger for the formation of a 
dispositive, see Foucault 1978, 120; Bührmann and Schneider 2008, 53, 61. 
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SELLING SOUND: 

TESTING, 
DESIGNING, AND 
MARKETING SOUND 
IN THE EUROPEAN 
CAR INDUSTRY 


EEFJE CLEOPHAS AND KARIN 

B I JSTERVELD 


Introduction 


Since the late 1990S, leading automobile manufacturers have advertised the sonic 
qualities and interior tranquility of their vehicles with increasing fervor. The new 
owners of a Chrysler Voyager will find a “whispering stillness” waiting for them. 
The Jaguar S-type is propelled by a “silent forcé,” while in a Toyota Avensis “even 
the silence comes standard.” Amid the bombardment of noises our ears have to put 
up with, we will experience “tranquility” in a BMW diesel, Mercedes-Benz, and 
Volkswagen Passat. Better still in the Ford Focus, everything we “touch” comes 
with “the sound it is supposed to make.” 1 

Clearly, these advertisements express automakers 5 growing confidence in 
their ability to enhance the overall quality of car driving with respect to its various 
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auditory dimensions. Indeed, manufacturers have invested considerable time 
and money in making sure that switches, warning signáis, direction indicators, 
windshield wipers, the opening of car Windows, the locking of car doors, or the 
crackle of the leather upholstery come with the right sound. Such sounds, or “target 
sounds,” should express their make’s identity and attract particular groups of 
consumers. Two simple figures demónstrate just how much today’s automotive 
industry invests in its producís’ sonic characteristics: For instance, BMW employs 
more than 150 acoustical engineers, and Ford has an acoustical department of 
200 employees (Jackson 2003, 106; Van de Weijer 2007, 9; RH, 20). 

It appears to be far from easy, however, to define the kind of interior sounds 
specific groups of consumers prefer. It turns out that many customers, when acting 
as test subjects, find it hard to articúlate their priorities in the area of sound, while 
experts diverge significantly over the best methods for measuring and evaluating 
sound. No wonder, then, that a distinct field of research has emerged, one that has 
cióse ties with the car industry and aims to find “the holy grail of psycho-acoustics” 
(Vetter 2004). Vehicle simulators and specific techniques designed to make test 
subjects express what they find difficult to articúlate — how they think about 
sound — are center stage in the search for a high sonic quality and the most 
attractive target sounds. 

Our chapter focuses on the rise of a new tradition of testing car sound in the 
European automotive industry in the 1990S. We explore three issues: FIow do ways 
of defining the “reality” of sound perception and the differences between expert 
and lay listeners affect the dynamics of testing in car manufacturing? Why does 
extensive testing of car sound not automatically result in the design of new target 
sounds — sounds, such as “sportive” ones, for specific groups of consumers? And 
where does this increasing significance of sound design in the consumer industry 
come from in the first place? 

We address the first two questions by drawing on sound studies and Science 
and technology studies (STS). While the field of sound studies helps us to unravel 
the early history of car sound and its cultural meanings, as well as to understand the 
complex phenomenon of how sounds are talked about, STS provides theories on 
testing and simulation. To answer our third question, we capitalize on the notion of 
Erlebnisgesellschafty or an experience-driven society: a society that constantly flags 
the significance of the immediate and subjective inner experience of its members 
(Schulze 1992). Together, these approaches clarify how new sense-oriented ways of 
marketing and designing cars have prompted new ways of testing. We go further, 
however, and also show that a mismatch can occur between testing and design of 
consumer producís especially at moments of transition in marketing priorities. In 
other words, our case does not just exemplify the recently increased sonic sensitiv- 
ity in design settings; it also provides insight into the consequences of this sonic 
sensitivity for traditions of testability. 

Empirically, our chapter is based upon semistructured qualitative interviews 
with twenty-two participants in the European automotive industry, European 
research companies, projects devoted to the study of car sound perception, and 
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organizations that focus on either the standardization of vehicle sound measure- 
ment or sensory branding (see table 4.1). 2 We have interviewed staff members from 
the acoustical engineering departments of Ford, Opel, Renault, BMW, and Porsche. 
Most of our interviewees were formerly involved in OBELICS, an EU-funded proj- 
ect on car sound perception and design, coordinated by HEAD acoustics GmbH 
(hereafter: HEAD acoustics), a firm in acoustic Consulting. Those not affiliated with 
OBELICS were all experts working on car sound testing, design, and marketing. In 
addition, our analysis relies on field notes from the observation of testing sites, 
documents produced in the context of OBELICS, and publications by acousticians 
and the automotive industry on car sound design. 


A Short History of Sounds and 
the Automobile Industry 


In the early days of motoring, the loud noise of gasoline cars largely had a positive 
connotation. The controlled explosions of the late nineteenth-century internal 
combustión engine generated far more noise than the steam-driven and electric 
cars still around at that time (Volti 2004, 7-8). However, for the men who 
drove a gasoline car in those days — such as wealthy engineers, physicians, and 
businessmen — the noise was hardly a problem. On the contrary, noise stood for 
power, and it allowed one to impress bystanders (McShane 1994, 169; Mom 
1997, 475). Only in specific contexts did the noise of the gasoline car prove to be a 
concern. In the United States, for instance, it was not deemed acceptable to use cars 
in funeral ceremonies before the 1930S (Berger 1979, 183). 

From the 1920S onward, manufacturers of cars and car parts started advertising 
the “silence” of their cars’ engines, exhausts, Steel bodies, chains, brakes, and gears. 
This was tied in part to the fact that in the preceding two decades, engineers 
had begun stressing that mechanical friction and noise were two sides of the same 
coin and that noise reduction implied greater motor efficiency and longer engine 
lifespan (Dembe 1996, 195). “Inside of the smoothly gliding mechanism of the 
motor car,” a Hoover Steel Balls advertisement said in 1920, “is the well-poised 
ball bearing insuring locomotion luxury that is free from sounds which offend 
the hearing and shorten the life of the unit.” 3 “As friction and wear vanish,” 
manufacturer Hyatt claimed about its roller bearings, “so does noise. Cars, trucks, 
and buses which opérate quietly roll more easily, handle more lightly, perform more 
economically.” 4 

Early advertisements strived to show in addition that nothing should 
distract a customer from buying a particular car. The implication was that users 
should not be annoyed by extraneous rattles, creaks, and bleeps. This argument 
gained increasing credibility after the economic crisis of the late 1920S hit the 
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automotive industry. Manufacturéis were desperate to sell vehicles, and any 
marginal advantage was welcome. This gave rise to a new form of sound craft 
(Krebs, this volume). Western Felt Works secured “composite body silence” 
by eliminating “squeaks at metal to metal and wood to metal contacts.” 5 Budd’s 
“one-piece body” promised “permanent quiet.” As Budd explained: “Every man 
who sells or Services automobiles knows the grief that can be caused by body noises. 
‘Squeaks and squawks’ run free Service costs way, way up. They’re deadly to the new 
owner’s enthusiasm for his car. They’re responsible for many a switch to another 
make of car” 6 (figure 4.1). 


45 


To ihe man who is lired of listening 
ío squawks aboui squeaks 


Every man who sells or Services auíomo- 
biles knows Ihe grief that can be caused 
by body noises. 

"Squeaks and squawks" run free Ser- 
vice costs way, way up. They're deadly 
to the new owner's enthusiasm for his car. 
They're responsible for many a swiich to 
another make of car. 

• • • • • 

The auiomobile body that has no joints 
just can'i squeak or raüle. By eliminating 
parís and by the extensivo use of flash 
welding, Budd eliminates joints. The Budd 
All-Steel one-piece body is made from 
only eight major parís — flash-welded 
inio one single jointless unit. 

This construction was originated and 
perfecied by Budd engineers. No other 
iype of construction can equal it for quiet 
— permanent quiet. 

The silence of the Budd one-piece body 
enables dealers to eliminóte their free Ser- 
vice costs on body noises. More ihan that, 
its quietness is an importan! facior in selling 
cars — and in helping to keep them sold. 



* DkJUilmw DI DULfU 

Originalors of ihe All-Sieel Body. Supplied io Manufacturera in ihe Unlied Siaies, Great Briiain. France and Germany 

Automotive Industries Januarv 2, 1932 


Figure 4.1. BudcTs advertisement: Automotive Industries (Jan. 2, 1932): 45. 
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In the latter half of the 1930S, increasing protests against urban trafile noise 
were another major factor in the growing concern with car sounds. They reached 
such intensity that producing quieter cars became a goal in its own right. In many 
parts of Europe, it became mandatory to equip exhaust pipes with mufflers, and 
even the máximum number of decibels produced by car sounds and horns was 
regulated (Bijsterveld 2008). Car manufacturers responded to these antinoise 
campaigns by presenting their cars as silent. In a booklet accompanying the 
British Noise Abatement Exhibition of 1934, Ford Motor Company advertised its 
new Ford V-8 with a text that perfectly expresses the three themes mentioned ear- 
lier: the efficiency of silence, the attraction of luxurious sound, and the need for 
less noise: 

Oft, in the Stilly Night, You see the lights of an obviously large, luxurious car. 

You hear nothing but the whisper of its generously dimensioned tyres, rolling 
over the road. Notable for its Engine and Transmission Silence, Smoothness 
of Progress, Freedom from Vibration, Altogether and in Every Way the 
Embodiment of Efficiency, as Well as of Surprising Economy, the Car of 
Tomorrow for the Lover of Noiselessness is THE NEW FORD V-8. (Noise 
Abatement Exhibition 1935, 69) 

The same booklet carried advertisements for “silent” vacuum cleaners, typewriters, 
and water toilets, but cars were among the first objeets that were subjected to 
noise-abatement regulation. 

This early noise-abatement regulation gave rise to the first research on the 
perception of car sound. In 1929, the National Physical Laboratory (NPL) in London 
tried to find criteria for the “strideney” of motor horns by order of the British 
government. By measuring the loudness, acoustical pressures, and average wave- 
form of the sounds produced by a wide variety of motor horns, as well as by having 
these sounds evaluated by listeners, it became clear that it was largely loudness that 
accounted for strideney. 

In the subsequent years, the NPL established a tradition of jury testing for the 
evaluation of sound. In these tests, members of listening juries — ranging from 
dozens of participants to many hundred — would judge the noise of horns, motor 
vehicles, or aircraft outdoors (Bijsterveld 2008, 204-207). In the early 1960S, 
D. W. Robinson, one of the leading acoustical experts at NPL, claimed that such 
jury tests offered a perfect “middle way” between laboratory studies and social sur- 
veys, between preciseness and validity, and “some compromise between scientific 
rigour and realism.” Outside, untrained listeners had no trouble rating the sounds 
of actual motor vehicles in terms of loudness, but when the sounds were recorded 
and played back in a sound-absorbent room, “our judges did not seem to be able to 
make up their minds at all.” Moreover, Robinson was not sure whether “unpleas- 
antness” could be tested in the same way as “loudness.” One problem was to choose 
the right words. The “inelegant” words disturbingness and bothersomeness might be 
synonymous to one person yet dissimilar to another. And would “ lastigkeif in 
Switzerland mean the same thing as “noisiness” in England? 7 Robinson’s remarks 
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went not just to the heart of the verbalization problem in evaluating sound; they 
also went to the heart of the very phenomenon of testing. 


Testing in Science and 
Technology Studies 


Since the early 1980S, scholars such as Edward Constant (1980, 1983), Donald 
MacKenzie (1989), Walter Vincenti (1990), and Trevor Pinch (1993) have identified 
testing as a highly significant research area for inquiry in Science and technology 
studies. Because the engineers doing the testing consider test data as “a final check 
on whether the expert’s conception of reality conforms to the physical world at 
hand,” there is really something “at stake” in testing (Pinch 1993, 26). 

All kinds of testing follow similar patterns. For instance, a highly significant 
property of testing is projection. Pinch clarifies this concept as follows: “If a scale 
model of a Boeing 747 airfoil performs satisfactorily in a wind tunnel, we can proj- 
ect that the wing of a Boeing 747 will perform satisfactorily in actual flight.” Such an 
“act of projection” from the small to the large and firom the present to the future 
“depends crucially upon the establishment of a similarity relationship. It is assumed 
that the State of affairs pertaining to the test case is similar in crucial respects to the 
State of affairs pertaining to the actual operation of the technology.” Disputes about 
tests often resolve around the issue of whether a particular test can be judged as suf- 
ficiently similar to the actual situation or is deemed to be significantly different 
from it. Moreover, Pinch emphasizes that such judgments of similarity and differ- 
ence “always rest within a broader framework of commitments and assumptions 
about how a technology will opérate,” adding that valid tests results “depend upon 
the acceptance of a similarity relationship, and such a relationship can only be con- 
structed within a body of conventions or within a form of life” (Pinch 1993, 
28-29). 

In this sense, debates about the quality of tests concerning technologies are 
analogous to those about the character of replications of experiments in Science, as 
Donald MacKenzie has stressed (MacKenzie 1989). Whether a replication is consid- 
ered sufficiently similar to the original experiment in order to count as valid proof 
often depends on the judgment of whether the experimental procedures have been 
executed properly and competently. Yet since many of such procedures involve 
tacit knowledge, the discussion about replication often leads to the problem of what 
Harry Collins (1983) calis “the experimenters’ regress” in Science: a seemingly end- 
less and detailed discussion of how the replication has been similar or dissimilar to 
the original experiment without resolving the academic issue at hand. Such similar- 
ity discussions also form the crux of disputes over testing. 
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MacKenzie reminds us, however, that most testing is “routinely accepted as 
fact” (1989, 415). He does so by referring to the work by Edward Constant (1980), 
who has shown that “traditions of testability” are at the heart of the routine prac- 
tices of communities of technological practitioners. With the origins of and resis- 
tance to the turbojet revolution in the aircraft industry in mind, Constant claimed 
the following: 

Particular techniques, technologies, and practices relating to testability ordinarily 
will be commensurate with, and to some degree will define, a specific commu- 
nity’s normal technology. The dominant system and conventional modes of 
testability usually are mutually reinforcing. To overthrow a conventional system 
frequently requires, in addition to the formulation of an alternative system and 
the definition of new performance parameters, the creation of new or much 
refined testing techniques. (Constant 1980, 22) 

Accordingly, methods of testing — or traditions of testability — and the dominant 
design characteristics of a particular technology are closely interconnected. 

How this works in practice has also been illustrated by Walter Vincenti in his 
book What Engineers Know and How They Know It (1990). In one chapter Vincenti 
examines the establishment of design requirements surrounding the flying qualities 
of American aircraft in the interwar years. Vincenti unravels the way in which a new 
tradition of testability had to be established before the aircraft industry was able to 
identify flying-quality specifications that both reflected subjective pilot opinión and 
were sufficiently precise to be useful in the process of design. 

The idea behind the specification of flying qualities was to produce aircraft 
“possessing adequate stability, responsiveness to the Controls, lacking in eccentrici- 
ties or sudden changes of behavior, and generally satisfactory to the pilot” (Consult- 
ing engineer Edward Warner, quoted in Vincenti 1990, 81). The initial problem, 
however, was that test pilots expressed their opinions of flying qualities in highly 
subjective and qualitative language. In their terms, the Controls would respond 
“very readily,” for instance, while the pressure exerted on the Controls was “normal,” 
and the longitudinal, lateral, and directional stability were considered “good” 
(Vincenti 1990, 64). This bore no relation, however, to the formal engineering 
criteria for describing particular types of aircraft stability or to what the engineers 
could measure in flight. 

Only after prolonged cióse cooperation between engineers, instrument makers, 
and pilots at a particular laboratory did the aircraft industry manage to develop 
a set of precise instruments and piloting techniques to make measurements in 
flight, as well as increased refinement of pilot opinión, so that the pilot “ knew what 
he liked” and could say so” (Vincenti 1990, 103; our emphasis). In this process, 
the use of “standardized terminology and definitions for research pilots and 
engineers and standard rating scales for quantification of pilot opinión” was para- 
mount (Vincenti 1990, 98). This shared vocabulary would not have been possible, 
however, without the iterative testing in which small groups of experienced pilots, 
instrument makers, and engineers — expert testers — collaborated for long periods 
of time. 
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As we argüe in the following sections, the new experience-oriented marketing 
trend in the automotive industry has elicited a new strand of “testability” in the area 
of car sound evaluation that resembles this tradition of the testability of aircraft 
flying qualities in several, yet not all, respects. Just like the aircraft engineers, the 
acoustical engineers who aimed at concrete sound-design specifications for cars 
had to construct a “realistic” setting for listening to car sounds in which the input of 
sound could be measured precisely. Similar to the aircraft engineers, moreover, the 
acoustics people needed to find ways to have the test subjects verbalize their evalu- 
ation of sound in a maximally sophisticated manner. As one of the researchers 
involved put it graphically, they had to find tools to “squeeze out” the test subject’s 
experience of sound as if this subject was an “orange” (NC, 4). To this end, the 
engineers collaborated with psychoacousticians. The test subjects themselves, how- 
ever, were less expert testers than the pilots. This, we will see, proved to be a crucial 
difference. 


Artificial Heads and SoundCars: 

CoNSTRUCTING ReALITY IN TEST SeTTINGS 


The OBELICS research project, launched in 1997 and funded by the European 
Union, lasted for three years. Its playful acronym stands for Objective Evaluation of 
Interior Car Sound. The project’s main goals were to understand the subjective 
evaluation of car sound ( ÍC the basis of sound language and sound perception”), to 
establish “methods and tools for an objective evaluation” of automotive sound, and 
to define “target sounds for different driving situations” (OBELICS 1999, 3, 6). Or, 
in the words of one of its participants: “What is a sportive car? How should a luxu- 
rious limousine sound?” (WK, 4). The institutes and companies collaborating in 
the project represented the automotive industry (Renault and Fiat), academic 
acoustics (Oldenburg University and Bochum University in Germany), and consul- 
tancy companies that specialized in acoustic measurement and analysis (HEAD 
acoustics) or in testing and simulation. 

Looking back on the project, several of its members were highly positive about 
what it had delivered in terms of the subjective and objective evaluation of sound 
and the tools for such evaluation. They were much less enthusiastic, however, about 
what it had meant for the definition of target sounds (NC, 7-10; KGi, 2-3; WK, 4; 
RSi, 9). One reason mentioned by the interviewees involved the difficulties of col- 
laborating with competing car-manufacturing companies — who were not fully 
forthcoming about their marketing data — in one project (NC, 10; KGi, 3; WK, 5; 
RSi, 15). One interviewee added that a European project like this simply could not 
do without the “rhetoric” of heading for a “concrete goal” that could convince 
“Brussels,” even though not everyone considered this aim “realistic” (NC, 7-8). 
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Other comments, however, hinted at the complexities of linking the results of car 
sound evaluation to the specifications for preferred car sound and the actual target 
sound design. We know from the story about the flying qualities of aircraft that 
connecting subjective evaluations to concrete design requirements is difficult, yet 
not impossible. What, then, happened in the case of automotive sound? 

Let us first focus on how the OBELICS members and the research projects 
preceding OBELICS dealt with projection and similarity issues concerning the 
car sounds to which the test subjects were asked to listen. Which testing sites and 
instruments did they consider necessary in order to make listening in the testing 
situation sufficiently similar to listening to car sounds in a real-life situation? And 
which tools, in their view, could measure in great detail what the test subjects 
listened to? 

In 1986 HEAD acoustics, the coordinating company behind OBELICS, was 
established. In the early 1980S, Klaus Genuit, the company’s founder, collaborated 
with Daimler-Benz on car sound analysis. Daimler had been confronted with cli- 
ents’ complaints about noise, but the company — relying on a conventional setup 
of a microphone, a sound-pressure-level meter, and a frequency analyzer — failed 
to find the origin of the problems (KGi, 1, KG2, 2). Genuit therefore developed 
an artificial head, or Kunstkopf > that could record and play back sound. His 
claim was that the recordings resembled human hearing much more closely. In 
contrast to traditional techniques, the artificial head took directional patterns 
and dynamic range into account. In addition, since the artificial head could both 
record and play sounds, it could be used for listening tests. Moreover, its software 
enabled modifications and analysis of the sounds. One could thus “manipúlate 
something and recognize [it] directly by listening : £ oh, this pattern, that is disturb- 
ing me. Not this, not this, but this 5 ” (KG2, 2). On the basis of this idea, Genuit 
established HEAD acoustics, which now works for the automotive industry world- 
wide, as well as for the information technology and domestic appliances industry 
(KG2, 6). 

Soon, however, the HEAD acoustics personnel decided that just sitting in a 
seat and listening to car sounds over loudspeakers or with binaural headphones 8 
was not realistic enough. First, they developed a SoundCar that mimicked a car in 
which the test subjects would not only hear car sounds — over headphones — but 
would also feel their seat and wheel vibrating at the very same time. For quite some 
time it had been known that noise, vibration, and harshness were interrelated in 
the evaluation of vehicle sound (which in fact had led this field of research to be 
labeled noise vibration harshness [NVH]). For this reason, the test site should be 
constructed accordingly so as to offer the test subjects an experience of the full 
range of effects (KGi, 4). Yet the disadvantage of the original SoundCar was that 
when sitting in the test car and opening the throttle, one “would indeed hear 
the right sounds, but . . . not feel the acceleration, yet stupidly stand still in the 
hall” (KGi, 4). As one interviewee added, “It was even a bit comical: we drove 
about 120 and looked out the window, and there were our colleagues standing next 
to us” (RH, 3). 
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This is why HEAD acoustics positioned the SoundCar on a fahrbar (moving 
belt) and also transformed it into a tool in which the test listeners could actually 
interact with the car and hear the car sounds accompanying their actions. For this 
simulation, HEAD acoustics developed its H3S software (HEAD 3-dimensional 
Simula tor), or Sound Simulation System. This system, as the firm claims in its 
advertising, “accurately and interactively simulates different driving situations. In a 
driving simulator [like a SoundCar] it creates not only engine, tire, wind, and other 
vehicle or background sounds but also structure-borne sound in the form of vibra- 
tion” ( HEADlines 2003, 5). Instead of being passive recipients of car sounds, the test 
subjects were now expected to interact with the setting. The headphones or speak- 
ers would produce new sounds only when the test driver acted, such as by shifting 
gears or putting on a turn signal. 

The third and most recent step in the process of creating more realistic test 
conditions has been the construction of a moving car sound simulator, the Sound 
Simulation Vehicle, which HEAD acoustics developed in collaboration with Ford’s 
Acoustics Center in Cologne. While actually driving a real car along the road, dif- 
ferent sounds could be channeled to the driver’s ears via headphones. The increas- 
ing similarity between the test site and actually driving a car created new problems 
for the evaluation of sound, however. First, a classic trade-off between the need for 
a “realistic environment” and the need for control popped up — between “realism 55 
and “rigor” in Robinson’s terms or between external and internal validity in today’s 
quantitative research vocabulary. In the Sound Simulation Vehicle, the researchers, 
unlike in the original SoundCar, are unable to control the car’s vibrations, and thus 
they do not have the option of modifying vibrations and experimenting with the 
relationships between the perception of vibration and sound (KGi, 5; RSi, 20). 
Moreover, when driving the Sound Simulation Vehicle, unlike in the lab car simu- 
lation, one cannot control test-driving conditions, such as specific weather or road 
conditions (SP, 9; RH, 14). 

In addition, the automotive engineers actually working with the mobile sound 
car, such as Ford, stumbled upon new “realisms” when doing the tests. In one such 
test the sound of a glamorous 500 PS Aston Martin was added to a relatively ordi- 
nary make, a Ford Focus. Test drivers opening the throttle of the Ford Focus would 
hear the sonic feedback of an Aston Martin: “And acoustically you get such feed- 
back that you think, c Wow, now we’re really driving, 5 but then not much really 
happens because the car altogether lacks performance. 55 This could be quite danger- 
ous in fact when trying to pass another car (RH, 7). In this way, the testing situation 
itself created shifting conceptions of what listening while driving was really about. 

If it was a challenge to create testing sites in which the listening situation resem- 
bled the real situation of listening to car sounds as closely as possible, it was quite 
another challenge for both automakers and testing consultancy companies to 
acquire knowledge about how test subjects thought and felt about the sounds to 
which they were exposed. The problem they faced was how to squeeze out listeners 5 
inner experiences and emotions, especially when they were untrained listeners 
rather than experts in acoustics. 
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Finding Words for Sound: 

The Expression of Sonic Perception 


For a long time, as Robinson’s remarks from the 1960S already suggest, it was not 
self-evident to use lay listeners when evaluating sound. In acoustics research in gen- 
eral, many early laboratory tests were executed by the acousticians themselves, and 
it was only in the course of time that the researchers started utilizing lay listeners. 
Moreover, the car industry continued to use expert listeners (and still does today) — 
often the acoustical engineers themselves — for the testing of particular sounds, 
such as car door sounds (BL, 9; SP, 9; KG2, 19). Using lay listeners became increas- 
ingly important, however, as the car manufacturers wanted to know how potential 
customers evaluated their car sounds. 

However, how did the automotive industry consider it possible to measure 
what these lay listeners experienced in terms of car sound? Initially, both the car 
industry itself and HEAD acoustics used so-called pair comparison tests and scale 
assessments. In pair comparison, or A/B tests, test subjects compare two sounds 
(sound A and sound B) that immediately follow each other, for instance in terms of 
agreeableness, and subsequently compare other pairs: A and C, B and C, and so on. 
The advantage of this method is that the test subjects do not need to remember the 
sounds for a prolonged time and can focus on very small differences. Yet, as a HEAD 
acoustics employee claimed, it is “artificial” since in everyday situations “one does 
not [constan tly] jump from one car into another” (NC, 4). Because the test subjects 
are “forced” to make a choice, they often report feeling insecure about whether they 
“have done the assignment well” (RH, 17), or they “just think of something else” 
and simply decide to not to participate in the test (RS, 8). 

The conventional alternative for the pair comparison test is the scale assess- 
ment test. Scale assessments allow test subjects to evalúate a particular phenomenon 
with the help of a scale. Whereas pair comparison is seen as relative testing, scale 
assessment is considered absolute testing. Again, test subjects find it rather hard to 
do such scale ratings in the case of sound (SP, 6). As one of us experienced herself 
at the lab of HEAD acoustics, it is difficult to rate the quality of a particular sound 
without the option of comparing it to other sounds. Furthermore, how does one 
choose between the third, fourth, or fifth box on a scale from one to seven? 

In psychoacoustics research, the semantic differential, a specific form of scale 
assessment, was introduced for sound perception in 1954. The original test focused 
on the meaning of underwater sounds to navy sonar operators. It displayed fifty 
bipolar attributes, such as “powerful-weak,” for the evaluation of complex auditory 
stimuli. Many of these attributes carne from descriptions of sonar sounds by jour- 
nalism students who had received specialized training in verbal expression and 
from “sonar recognition cues” used in teaching methods for sonar experts (Solomon 
1954, 17-18). Since the 1970S, these lists have been fine-tuned several times, for 
instance for vehicle acoustics. The members of OBELICS used a set of thirty-four 
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bipolar attributes, such as “boring-stimulating,” “cheap-expensive,” “loud-soft,” 
and “tiring-relaxing” (Hempel 2001, 170) (figure 4.2). 

One particular problem in tests based on the semantic differential, however, is 
this use of language. Since the project involved several European countries, the test 
subjects spoke different languages. However, as Robinson suggested as early as the 
1960S, words for sonic attributes that seemed to be translatable from one language 
to another might not have the same connotation in the culture using that language. 
To make the project truly European, one member of OBELICS developed a thesau- 
rus in which each set of expressions in Germán had its counterpart in English, 
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Figure 4.2 The semantic differential used in the OBELICS project: 

Thomas Hempel. Untersuchungen zur Korrelation auditiver und instrumenteller 
Messergebnisse für die Bewertung von Fahrzeuginnenraumgerauschen ais Grundlage eines 
Beitrags zur Klassifikation von Hórereignissen (Munich: Herbert Utz Verlag, 2001, 170). 

Courtesy Herbert Utz Verlag. 
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French, and Italian (WK, 4). Indeed, the thesaurus and cross-country comparison 
enabled the researchers to show that some car sound attributes had quite different 
meanings in different countries. The sonic understanding of sportiveness in Germany 
differed from that Italy, and loudness did not have the same significance in France 
that it did in Germany (NC, 3). This, Klaus Genuit suggested, was dependent on the 
traditions of car engines and gear systems — engine power, automatic shifting or 
not — in use in different countries (KG2, 16-18). Differences in technological devel- 
opment itself thus seemed to undermine the cross-country equivalence of rating 
words for sound in different languages. 

The OBELICS members developed a similar sense of variety when working 
with the semantic differential test in Germany. The words used for the bipolar attri- 
butes were unknown to many of the lay test subjects or were interpreted in a highly 
arbitrary manner: 

For example, what is rau [rough]? Do you know what sound rau is? What is a 
rough sound? Rau consists of the letters R and A, rrrrrrrrr, is this rough? Is aaaaaa 
rough? [I (KV) reply: “Rather R than A.”] Psychoacoustically, though, exactly the 
opposite is the case; A is very rough psychoacoustically because it comes with a 
high intonation. R comes with a deep intonation. It is not at all as rough, from a 
psychoacoustical angle, but in practice you are right, for most people think of 
Rauhigkeit as rrrrrrrrr. (KGi, 11) 

Moreover, during the semantic differential tests, the researchers noticed that 
the test subjects were inclined to tell the test leader much more than was actually 
asked for. One subject insisted on telling about her vacation experiences while 
traveling by airplane since the car sounds were “similar” and just as “pleasant.” 
Another left the test site crying “because a specific sound reminded her of an acci- 
dent” (NC, 4). In order to record these additional stories, the researchers intro- 
duced associated imagination of sound perception ( AISP), which was later reworked 
into expío rative vehicle evaluation (EVE). In the construction of these tests, Brigitte 
Schulte-Fortkamp, a psychoacoustics and sociology professor at Oldenburg 
University (later Berlín) played a significant role. 

In the AISP tests, the subjects, while positioned in a SoundCar, were asked to 
comment on the car sounds freely and mention every association that might pop up 
in their minds. Subsequently, they were interviewed about their comments, and all 
of them “found sentences to express themselves” (NC, 19; SP, 5). Through such 
tests the researchers claimed to reach “another dimensión in the description of 
noises,” one much closer to the test subjects (NC, 3-6; AF, 3; RS2, 4). Instead of 
mere evaluations of sound, one now had a way to understand their origin — the 
why behind the what. Each personas assessment of a particular sound, they said, was 
based on gesammelte Erfahrungen , a series of earlier experiences and memories and 
expectations built upon them. Two persons, for instance, might both consider a car 
sound loud. Yet, while the first one would love it, claiming to be a “fan” of the car 
tested, the second considered it “too loud . . . for I imagine myself having to drive 
this car for five hours, which I do regularly, Paris-Aachen, for instance” (NC, 6). 
The explorative vehicle evaluation tests, created after the completion of the OBELICS 
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project, were tailored to elicit such stories in situations in which test subjects would 
actually drive a car while listening. 

The usefulness of the AISP and EVE tests for sound design was highly contested 
within the automotive engineering and acoustics community, however, and not 
only because the tests were time consuming and expensive (WK, 13, RS2, 3). 
Scientists and engineers from Oldenburg University considered the tests too vague 
and doubted their usefulness (NC, 12-13; KGi, 14). Many acousticians and acousti- 
cal engineers had hoped to set up a sort of universal sound quality Índex, in which 
a series of numbers would indícate which design specifications would lead to sounds 
of high quality (RS, 4-5). They did not see how the outcomes of the AISP tests could 
bring the sound quality índex any closer (KGi, 14-13). An Opel engineer explained 
that he actually preferred a sound quality system that would get rid of differences in 
taste (BL, 2). 

Many of the OBELICS personnel involved in subjective testing considered such 
a sound quality Índex a bridge too far, at least for automotive sound. In their view, 
car sound quality depends simply too much on context (TH, 3; WK, 10; RSi, 6; 
SP, 14), and ideas about the best sound quality will probably evolve over time (NC, 10). 
They were similarly skeptical of the possibility of designing a few widely loved car 
sounds, the “right” sounds for many. One of them even considered it “completely 
utopian,” especially for interior sounds. A target sound will work only under strictly 
confined conditions, say, if “I drive a 4-cylinder, midsize, 30 km/h, I (being under 
twenty-five years of age) would prefer something like a sporty car, without exhaust 
or anything” (NC, 8). In fact, he and others claimed, one can define preferred 
sounds only for very specific groups of consumers. This implies that one should 
also invite narrowly defined target groups for the testing procedures (NC, 8; RSi, 10; 
SP, 11). Moreover, particular expectations will affect the evaluation of sound: “When 
I drive a Mercedes-S, I expect something different from what a Porsche driver 
expects from his Porsche. Were I to interbreed both cars, both drivers would feel 
unhappy. So I cannot simply say This sound is okay’ ” (KGi, 7). HEAD acoustics’ 
president Klaus Genuit therefore felt that AISP and similar tests can be commer- 
cially exploited only for quite specific versions of target sounds or “benchmarking” 
(KGi, 3). 

Although Genuit expressed disappointment over the engineers’ inclination to 
consider everything measurable (KGi, 14), he is an engineer himself — as are many 
OBELICS members, even those involved in subjective testing. This is why the line 
between proponents and opponents of AISP was never simply one between engi- 
neers and nonengineers. Rather, there was, by and large, a distinction between 
nonengineers and engineers involved in Consulting on sound design on the one 
hand and those involved in implementing sound design on the other (WK, 10-11). 
Given their reliance upon numbers responsible for implementing new designs, the 
“implementation” engineers considered AISP not easily “management-compatible” 
(TH, 5 ). 

Some makes, such as the Harley Davidson motorcycle and the Porsche sports 
car (RS, 12, 15; PE, 3) already had exterior target sounds connected to their image. 
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Yet in these cases, several interviewees claimed, engine sounds originated first, 
while the idea to treasure these as target sounds carne only much later. Such “acous- 
tical fingerprints” might work (RH, 14). Porsche, for instance, had worked hard to 
keep its engine sound stable even when it shifted from its boxer air-cooled engine 
to the 6-cylinder, water-cooled system (KGi, 9). Moreover, representatives from 
the luxury car manufacturers on the ISO committee for the standardization of vehi- 
cle noise measurement (Technical Committee 43, Working Group 42) preferred 
taking tire noise into account when measuring car sound because this would give 
them more leeway in dealing with the engine sound (US, 4-5; FR, 31-40). If the 
measurement procedure had focused solely on engine noise, manufacturers like 
Porsche would have been forced to change the characteristic sound of their car 
engines. 

Yet designing a target sound from scratch was widely considered to be a com- 
pletely different challenge. What’s more, who should have the final word in the 
actual design? A Ford employee explained that one simply could not leave the 
conception of target sounds in the hands of the test subjects only: 

When you ask customers how a sound should be, you will get as many different 
answers as the number of customers you ask. This means that in part you cannot 
have the customer decide; after all, you need to hold on to the characteristic Ford 
sound. As a distinctive sound it also has to be embedded in a whole range of 
different things. The sound should fit in with everything we want to convey 
through our car philosophy. (RH, 8) 

In the ideal world of this Ford sound engineer, future Ford customers would be able 
to upload a series of car sounds (for the turn signáis, seat-belt warning, windshield 
wiper, and so on) of their choice, just like uploading a ringtone for a cell phone. Yet, 
and this was a crucial twist, this option should be available to the customers only 
after the Ford sound engineers had created a full “sound composition” in which all 
of the sounds would both be typically Ford and go together extremely well (RH, 8; 
12, PE, 4). Moreover, design needed to be accompanied by marketing techniques. 
When Ford had designed a new fancy turn-signal sound (“pock-pock-pock”) for its 
Ford Focus, many test subjects and automotive journalists initially rejected it 
because of their nostalgia for the venerable “click-clack” sound originally linked to 
the relay. Only after the Ford Focus had been advertised as a first-rate modern car 
and a story had been linked to its futuristic turn-signal sound did the new sound 
come to be accepted (RH, 9-10). 

Yet even aside from the sonic cultural conventions the automakers had to deal 
with, it is clear that in the debate over sound quality, acoustical engineers desiring 
an Índex and researchers heralding the significance of subjective testing by lay 
listeners were pitted against each other. One way to understand the tensions between 
the two is suggested by the work of linguistic anthropologist Tom Porcello (2004). 
Porcello analyzed a conversation between an experienced, professional producer 
and a student, a novice, in a recording studio by focusing on their talk about sound 
during a recording session. While the professional was used to talking in terms of 
musical instruments and technologies such as microphones to explain the kind 
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of sound he wanted, the student referred to particular bands and songs to 
express the sounds he preferred. This led to profound misunderstandings between 
the two despite the producer's efforts to bridge the gap by using metaphors and 
mimicry. 

These misunderstandings seem to be similar to those between the index-loving 
acoustical engineers and the protagonists of subjective lay testing, with the test sub- 
jects in the role of novices and the acoustical engineers in the role of the producers. 
The discursive clash that Porcello analyzed also reminds us of the misunderstand- 
ings that originally kept apart the aircraft engineers and the pilots in the flying qual- 
ities case described by Walter Vincenti. In Vincenti's example, this was solved by 
having the pilots and the engineers work closely together — even in the aircraft 
itself — to find connections between the pilots' ways of expressing themselves and 
the engineers' technology-related thinking. In the car sound evaluations, however, 
there was little sustained collaborative effort between the engineers and the lay lis- 
teners. We explain this by referring to the marked differences in status between test 
pilots and test car drivers. While the first are members of a profession with, in the 
terms of sociologist Andrew Abbott (1988), an “exclusive jurisdiction” concerning 
the flying of aircraft, everyday car drivers cannot claim such a special position. The 
officially acknowledged exclusiveness of the pilots' skills and certainly test pilots' 
skills fueled the interest of the engineers in a sustained collaboration with these 
pilots (Vincenti 1990, 70-71). 

We have already explained why the acoustic consultan ts at OBELICS considered 
it important to work with lay listeners. Preferably, these lay listeners should be highly 
specific groups of car consumers tailored to the type of car tested. As one engineer 
said, it was quite absurd to have the sound of a hundred thousand Euro cars assessed 
by students — not exactly the usual candidates for driving such cars (BL, 3). At the 
same time, however, many of the interviewees stressed that expert listeners noticed 
far more than lay listeners (RS2, 5; WK, 14; BL, 3; KG2, 13). In contrast to lay listeners, 
experts were able to determine within the laboratory which sounds would sound 
natural outside the lab's walls (BL, 6-7). Yet outside the lab, the interviewees pre- 
ferred lay listeners. Why had this come to have such crucial significance? 


Marketing Sound in the Experience 

SOCIETY 


In the 1960S and 1970S, national governments and European bodies imposed increas- 
ingly stringent limits on the máximum noise emission of cars (Sandberg 2001; DC, 1-2). 
Between 1970 and 2000 the European máximum for passenger cars decreased 
from 86 to 74 dB (EL and VM, 6). As a result, automobile makers were forced to 
further limit car noise. Engineers had their hands full when it carne to reducing the 
low-frequency sound in cars (Stockfelt 1994). There were many such sounds: the 
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rustle of the wind, the engine vibrations, the sound of tires touching the road sur- 
face, the hydraulic system of power steering, and the whining of the fuel pump. 
Acoustical engineers tried to tackle these by, for instance, applying acoustic glass or 
introducing an innovative design for car tires (Dittrich 2001; Steketee 2006; 
Kouwenhoven 2002). Time and again, however, they would find that removing one 
noise rendered another one audible (Schick 1994; Freimann 1993; NC, 20). So, 
even though car manufacturers had been interested in interior car sound since the 
early 1920S, it was the obligatory reduction of exterior noise that brought formerly 
masked interior sounds into the limelight again after the 1970S (WK 8; BL, 11; RS, 11; 
SP, 13). 

Both the car industry and cultural sociologists, however, tend to explain the 
increasing focus on sound design by referring to the rise of experience society- 
related issues. Germán sociologist Gerhard Schulze (1992) argües that sensory 
experience plays a large role in the selling of consumer goods today. This emphasis 
has emerged because many producís have been perfected to such a degree that dif- 
ferences in technical specifications between brands have decreased over time. Since 
the 1980S most products now simply work. Moreover, most consumers today choose 
from an enormous array of consumer goods on account of the booming postwar 
economy. The many options available, however, also give rise to uncertainty. To 
compénsate for the absence of differences in technical quality and make selection 
easier for consumers, products are increasingly sold by cashing in on the emotional 
meaning and inner experience products evoke in buyers. 

In the marketing of new products, therefore, sensory experience has become 
crucial. How appliances feel, smell, or sound and how that fits the buyer’s identity 
have thus become as relevant as how they look (Marks 2002, 114; Wenzel 2004). All 
this has led to a strong “aestheticization” of everyday life, described by Schulze as 
the emergence of the Erlebnisgesellschaft , or experience society, as explained earlier. 
In such a society, experience valué trumps functional valué both in selling strategies 
and in the motivation of consumers (Schulze 1992, 59). 

Many of our interviewees within or related to the automotive industry 
presented a similar analysis of today’ s society (KGi, 9; RH, 20-21; BL, 10-11; AF, 4; 
RS, 8). Now that there are many wealthy people, one interviewee claimed, their 
concern for quality of life has come into play. In addition, since they no longer 
have to worry about whether their cars actually run, they may start complaining 
about its sound (NC, 11). What’s more, the automotive industry considers sound 
more easily observable by consumers than other aspects of automotive quality, 
such as safety (Zeitler and Zeller 2006, 1). Dependent on customers’ specific prefer- 
ences, cars should have “decent,” “luxurious,” “dynamic,” or “sporty” sounds 
(Bernhard 2002). Renault, for one, made its Clio 3 “silent” and gave its Megan Sport 
a “dynamical sound” (EL and VM, 2). 

Although there may not be sufficient evidence to warrant an analysis of 
contemporary culture in terms of the experience society, what is relevant to our 
understanding of the tensions between subjective car sound evaluation and the 
search for objective car sound specifications is that the world of marketing believes 
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in such a society and in sound as a “marketable vehicle attribute” (Repik 2003, 5). 
Designers are now obliged to “ascertain what emotional valúes they want the 
consumer to attach to the product. They then develop forms which instígate the 
associations to, hopefully, incúlcate those feelings” (Guy Julier, quoted in Lury 
2004, 87). Or, as brand experts Hajo Riesenbeck and Jesko Perry explain, fully in 
line with Schulze’s analysis, “The price of a new car is so high that buyers will have 
the basics covered anyway. . . . This makes it imperative for automotive advertising 
to appeal to the heart and soul rather than reason (Riesenbeck and Perry 2009, 29). 
This has grown even more important after an increasing number of different car 
makes and types began to be put together on the basis of the exact same set of car 
parts (Gottfredson et al. 2001). 

This shift in marketing strategies needs qualification to prevent misunderstand- 
ing. We do not claim that until recently marketing people have had no eye for 
things such as the image of a product or the lifestyle to be associated with the prod- 
uct. On the contrary, Wolfgang Kónig has convincingly traced attempts to create a 
positive product image by means of symbolic messages in advertising to the 
years prior to World War I (Konig 2000, 405). Ñor do we claim that companies did 
not have their producís tested by consumers: In Western Europe, firms started 
establishing testing labs and design departments in the 1950S (Oldenziel, de la 
Bruhéze, and de Wit 2005, 118). Yet, while manufacturers had long been interested 
in what the consumer did with the product-, since the 1970S they have become obsessed 
by what the product does to the consumer. What manufacturers now want to 
have tested is which emotions the product stimulates in test subjects, the future 
consumers. 

It is in this context that an extensive research industry has evolved that concen- 
trates exclusively on developing testing methods for tracing these feelings. One 
component of this new research industry is HEAD acoustics, providing the auto- 
motive industry (and increasingly the domestic appliances and personal Computer 
industry) with testing methods and actual testing of sound. In turn, companies 
such as HEAD acoustics collaborate with universities, whose staff members equally 
address the changes associated with today’s experience society (Schulte-Fortkamp, 
Genuit, and Fiebig 2007, 12). The notion that sound “is well known to enhance 
or detract from our pleasure in possessing or using a product” (Boulandet et al. 
2008, 1) has thus been reinforced by an emerging and growing network of manufac- 
turers, designers, testing companies, marketers, and academics who reciprocally 
spread the word of sensorial branding and design. 

In light of the trends in marketing just mentioned, it is understandable that 
HEAD acoustics endeavored to develop qualitative tests that would enable the full 
expression of the rich variety of test subjects 5 emotions with regard to car sounds. 
Because these tests had to reflect the evaluation of untrained listeners, that is, 
the everyday consumer of cars, HEAD acoustics increasingly selected lay listeners. 
It is ironic that while the automotive industry considered these consumers highly 
sensitive to sound, it took new ways of testing to have these consumers properly 
express their assessments of particular sounds. The experts 5 idiom of sounds did 
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not speak to these lay listeners. The new tests in the OBELICS project did not 
straightforwardly lead to the specification and design of target sounds. Ñor did it 
generate a sound quality índex, however. Why this happened becomes clear when 
we compare it with the project on flying qualities specifications as described by 
Vincenti (1990). Unlike this project, OBELICS encompassed no cióse and pro- 
longed collaboration between engineers and testers. Unlike the pilots, the car 
drivers were not members of an acknowledged profession with whom the engineers 
had to build a long-term relationship. 


CONCLUSIONS 


In the 1980S and 1990S the automotive industry displayed a widespread belief in the 
rise of the new experience society. This resulted in new marketing strategies that 
highlighted the significance of car sounds. The driver’s emotional experience of 
quality car sound, so it was believed, would help to sell a car. Moreover, specific 
target sounds would enable new connections between a car make and particular 
groups of consumers. 

This belief led to an emerging tradition of testability in the European automo- 
tive industry. The crux of this testability, in which the Germán company HEAD 
acoustics played the lead role, is subjective testing, which enables free, associative 
verbalization of car sound perception. Moreover, testing car sounds while driving 
(in a mobile SoundCar) became the heart of projection: the idea that the testing was 
“similar in crucial respects” to actual driving. 

The OBELICS project, in which representatives from the automotive industry, 
acoustics departments at universities, and testing consultancy companies collec- 
tively aimed at linking subjective testing to objective target sounds, did not imme- 
diately lead to the definition of such target sounds, however. In contrast to a similar 
research project on the specification of flying qualities for airplanes, HEAD acous- 
tics used lay rather than expert testers. Although this was fully in line with the mar- 
keting strategies of the automotive industry, it did not easily facilítate fruitful 
collaboration between automotive engineers and sound designers. Whereas test 
pilots and aircraft engineers had found a common language for the design specifi- 
cation of aircraft flying qualities through long-term, cióse, and interprofessional 
collaboration, the lay testers expressed their sonic preferences in the absence of the 
designers. In the designers 5 perspective, the lay testers behaved like the novice in 
Tom Porcello’s study of studio engineering: The words they employed for sound 
were incomprehensible to the sound designer. 

If the marketers are right in suggesting that the testing of consumer goods will 
increasingly concéntrate on the feelings that producís elicit in their users, scholars 
in STS and sound studies should take the specific difficulties of testing the sensory 
qualities of producís into account when studying the coproduction of technology 
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by consumers and producers. Michel Callón, when reflecting on the new “economy 
of qualities,” seems to be optimistic about this coproduction. In his view, the 
increasing focus on the construction of qualities in design and the organization of 
“real-life experiments on preferences” related to it “facilítate the intensification of 
collaboration between supply and demand in a way that enables consumers to par- 
ticipate actively in the qualification of producís.” This process implies consumers 
who are “capable of perceiving differences and grading them” (Callón, Méadel, 
and Rabeharisoa 2002, 212; our emphasis). Callón presents particular forms of 
Information and Communication Technology, which constantly tracks consumers’ 
actions as preferences, as his ideal example and also notes that consumers are 
supported in their evaluations by suppliers and their intermediarles. He assumes, 
however, that consumers in fact are capable of articulating what they want. 

As our argument underscores, a similar interpretaron of the new sensory- 
oriented testing traditions is unwarranted because, in the settings involved, con- 
sumers cannot directly vote for their preferences. Speech about sound (Krebs and 
Rice, this volume) is so invested with cultural habits, training, and differences in 
cultural capital that a good match between testing and designing car sound seems 
to still be in the distant future. We need to acquire more knowledge about the new 
traditions of testability related to the new economy of qualities and sensory- 
oriented marketing. In this respect sound sounds like the perfect case to start with 
indeed. 


NOTES 


1 “Let your car do the talking” (magazine ad, Chrysler Voyager, published between 
2002 and 2007); “U rijdt geen diesel, u rijdt Jaguar” (brochure, Jaguar, Netherlands, 2006); 
Zelfs de stilte is standaard , commercial, Toyota Avensis, videotape (Talmon: AV 
Communicatie, 1999); “Een mens krijgt 469.082 geluidsprikkels per minuut,” Elsevier 
(Oct. 23, 1999): 12-14; Business Travel , commercial, Mercedes-Benz, videotape (Stuttgart: 
DaimlerChrysler AG, 1990); Blind Brothers Three , commercial, Volkswagen Passat, 
videotape (Almere: TeamPlayers, 1999); a De nieuwe Ford Focus: Voel de verfijning,” 
Dagblad de Limburger (Jan. 24, 2005): B11. All translations are by the authors or language 
editor, Ton Brouwers. We thank him, Klaus Genuit, and Trevor Pinch for their 
comments. 

2 We thank Kristin Vetter and Fleur Fragola for the interviews they undertook. 

3 “Hoover Steel Balls” advertisement, Automotive Industries (Feb. 12, 1920): 61. 

4 “Hyatt Quiet Roller Bearings” advertisement, Automotive Industries (Jan. 30, 

1932): 5- 

5 “Western Felt Works” advertisement, Automotive Industries (Jan. 2, 1932): 64. 

6 “Bodies by Budd” advertisement, Automotive Industries (Jan. 2, 1932): 45. 

7 Archives National Physical Laboratory, Papers of D. W. Robinson, “Recent Advances 
in the Subjective Measurement of Noise,” [1963]: 157-77, at 164, 163, and 161, respectively. 

8 Binaural headphones replay recordings made with the help of two microphones, one 
for each ear, thus mimicking human hearing. 
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Table 4.1 OverView of Interviews 

Ñame of Background, Affiliation at Time of Date and Place 

Interviewee Interview y Former Affiliations 


Nicolás Chouard 
(NC) 

David Delcampe 
(DD) 


acoustic engineer, former 
Renault employee, formerly 
involved in OBELICS 
European Commission, DG 
Environment 


June 7, 2004 
Aachen, Germany 

October 23, 2007 
Brussels, Belgium 


Interviewer 

(Original 

Language) 

Kristin Vetter 
(KV) 

(Germán) 
Fleur Fragola 
(FF) 

(French) 
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Table4.i Continued 


Ñame of 

Background, Affíliation at Time of Date and Place 

Interview er 

Interviewee 

Interview , Former Affiliations 


(Original 

Language) 

Peter Ehinger (PE) 

head of acoustics department, 

December 10, 2008 

Eefje Cleophas 


Porsche, former member of ISO 

Weissach, Germany 

(EC) 


TC 43, WG 42 


(English) 

André Fiebig (AF) 

employee, HEAD acoustics 

July 31, 2009 

EC 



Kohlscheid, Germany 

(English) 

Klaus Genuit 

engineer, president, HEAD 

June 3, 2004 (KV) 

KV & EC 

(KG) 

acoustics, formerly involved in 

Aachen, Germany 

(Germán & 


OBELICS 

November 18 (EC) 
Kohlscheid, Germany 

English) 

Ralf Heinrichs 

engineer, Ford employee 

June 8, 2004 

KV 

(RH) 


Cologne, Germany 

(Germán) 

Thomas Hempel 

engineer, formerly involved in 

May 26, 2004 

KV 

(TH) 

OBELICS 

Munich, Germany 

(Germán) 

Ian Knowles (IK) 

European Commission, DG 

October 9, 2007 

FF 


Enterprise 

Brussels, Belgium 

(English) 

Winfried Krebber 

engineer, formerly involved in 

May 6, 2004 

KV 

(WK) 

OBELICS 

Aachen, Germany 

(Germán) 

Bernhard Lange 

engineer, Opel employee 

June 23, 2004 

KV 

(BL) 


Rüsselsheim, Germany 

(Germán) 

Eric Landel 

Renault employees 

October 16, 2007 

FF 

(EL)Virginie 
Maillard (VM) 


Guyancourt, France 

(French) 

Stephan Paul (SP) 

PhD student at HEAD acoustics 

June 17, 2004 

KV 



Bonn, Germany 

(Germán) 

Foort de Roo (FR) 

acoustic engineer, former 

June 26, 2008 

EC 


member ISO TC 43, WG 42 

Delft, the Netherlands 

(Dutch) 

Gerhard Thoma 

members of acoustics 

November 20, 2008 

EC 

(GT), Manuel 
Reichle (MR), 
Alfred Zeitler (AZ), 

department BMW 

Munich, Germany 

(English) 

Ulf Sandberg (US) 

engineer, member of Swedish 

October 1, 2007 

FF 


National Road and Transport 
Research Institute (VTI), chair 
of several committees of ISO 
and CEN 

Brussels, Belgium 

(English) 

Wolfgang 

European Commission, DG 

October 18, 2007 

FF 

Schneider (WS) 

Enterprise 

Brussels, Belgium 

(English) 

Roland Sottek 

engineer formerly involved in 

May 14, 2004 

KV 

(RS) 

OBELICS 

Aachen, Germany 

(Germán) 

Rob Vermeulen 

director of Total Identity 

August 7, 2008 

EC 

(RV) 

(marketing company) 

Landgraaf, the 
Netherlands 

(Dutch) 

Charles Zhang 

acoustic engineer, Renault 

November 13, 2008 

EC 

(CZ) 

employee 

Guyancourt, France 

(English) 






SECTION II 

ST AGING 
SOUND FOR 
SCIENCE AND 
ART: THE FIELD 




CHAPTER 5 


SOUND STERILE: 

MAKING 

SCIENTIFIC FIELD 
RECORDINGS IN 
ORNITHOLOGY 


JOERI BRUYNINCKX 1 


Introduction 


It is a mad, reckless song-fantasia, an outbreak of pent-up, irrepressible glee. He 
begins bravely enough with a number of well-sustained tones, but presently he 
accelerates his time, loses track of his motive, and goes to pieces in a burst of 
musical scintillations. (Mathews 1904, 49) 



Figure 5.1. Transcript of bobolink song. From Ferdinand S. Mathews, Field Book ofWild 
Birds and TheirMusic: A Description of the Character and Music ofBirds 

(New York: Putnam, 1904, 51). 
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When writing this in 1904, American naturalist and composer Ferdinand S. Mathews 
was not reviewing one of his fellow artists’ latest compositions. In fact, he was 
describing a motif of the “musical fireworks” produced by a bobolink, an American 
songbird. In his elabórate Field Book ofWild Birds and Their Music (1904), Mathews 
had made impressively detailed transcripts of the songs of almost 130 North 
American wild bird species. He had transcribed them all by ear and in musical nota- 
tion because “only the musical staff expresses [them] accurately” (Mathews 1904, 
xxi). With that claim, however, Mathews found himself caught up in an emerging 
controversy. 

Animal sounds had been recorded in musical notation for centuries — the very 
terminology of song and singing testifies to their original appreciation in musical 
terms. In the first decades of the twentieth century, however, a range of other tech- 
niques was developed to record such sounds “after nature” (Ranft 2001). Some 
naturalists found bird sound better represented by nonsense syllables or phonetic 
vowels; others devised their own graphic notation. In unisón, these recordists 
rejected musical notes as inaccurate and unscientific descriptions. They argued that 
musical scores represented the pitch and harmony of a song but that other features 
were more important. Such features, like timbre and sound quality, were rendered 
more clearly by their respective systems. Mathews in turn sneered at his skeptics 
and claimed that they were just not knowledgeable about musical theory (Mathews 
1904, i). 

However, in a twist of irony, the song that inspired Mathews to write such 
powerful prose appeared unsusceptible to his favored approach. Acknowledging 
that “the difficulty in either describing or putting upon paper such music is insur- 
mountable,” he quoted a fellow naturalist who had failed to write down this song as 
early as 1892: “We must wait for some interpreter with the sound-catching skill of a 
‘Blind Tom’ and the phonograph combined, before we may hope to fasten the kinks 
and twists of this live music-box” (Mathews 1904, 49). 2 Recording and representing 
the sounds of nature clearly carne with its own set of problems — problems that 
mechanical recording, or so these naturalists hoped, could help to solve. 3 Earlier, in 
1898, the audience of the American Ornithologists’ Union congress had been played 
a sound recording of a captive bird, indeed suggesting “great possibilities to be 
looked for in the future” (Sage 1899, 53). It was not until three decades later, how- 
ever, that university ornithologists and the recording industries in Britain and the 
United States began organizing frequent bird-sound recording expeditions. From 
the late 1920S onward, electrical recording devices began to feature prominently as 
tools for the study of birds’ singing behavior in the field. 

With this shift from recording by ear to a mechanical recording with micro- 
phone, the culture of scientific recording also changed. The emergence of electrical 
recording not only presented ornithologists with different material and technical 
conditions of their work. It also forced them to rearticulate the role of recording in 
their scientific studies, as well as the practical conventions guiding it: How was one 
to deal with sound recording as a scientific practice? In this chapter I connect these 
material conditions and scientific conventions by asking how the mechanization of 
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sound recording has altered the ways ornithologists dealt with natural sound as an 
object of scientific study. 

I show that electrical recording confronted the ornithologists with a different 
way of listening in the field — and later in the lab. Recording through mechanical 
“ears” rather than human ears gave ornithologists certain advantages, such as replay 
or “faithful reproduction,” but they also carne to experience the world as noisier, 
more contingent, and more uncontrollable than they had before. Electrical ampli- 
fication now forced ornithologists to decide what sounds to record, what to discard 
as noise, and how to eventually represent those sounds. I show that although these 
recordings were made outdoors, some scientists tweaked them such that ultimately 
they transformed a “messy” field sound into “cleaned-up” imagery. Through 
specific recording equipment and techniques such as the parabolic microphone 
and the spectrographic visualization of sound as data, ornithologists gained a par- 
ticular form of control over field sound. As a result, the distinction between field 
and laboratory practices becomes blurred. This, I suggest, results in a consistent 
sonic sterilization of field sound, the intentional erasure of noises. 

Thus, in investigating sound recording as a scientific practice, this chapter 
traces dynamics between field and laboratory, between Science and music, between 
wanted sound and unwanted noise, and between the audible and the visual. I do so 
by looking into the actual recording work of a small number of recordists and orni- 
thologists in the United States and the United Kingdom between the 1910S and the 
early 1960S. Empirically, this study draws on a longitudinal reading of five leading 
British and American ornithological journals, The Condor , The Auk , The Wilson 
Bulletin , Journal of Field Ornithology , and British Birds , combined with archival 
work at the British Library and a selection of memoirs by the ornithologists and 
recordists under study. The goal is decidedly not to survey a long history of wildlife 
sound recording. Instead, I single out two periods, in the 1930S and the 1950S, in 
which the entanglement of the production of sound recordings and scientific 
culture and conventions becomes especially clear. Additionally, the focus of this 
chapter on the techniques of nature recording and the involvement of the recording 
industry and academia presents a relevant contribution to the recording histories 
collected in this handbook (Helmreich, Kursell, Sterne, Taylor, this volume). 


The Sound of Field Science 


This chapter signáis a history of sound recording in the field , outside the typical 
concert hall or recording studio. This outdoor focus is not uncommon in Science 
studies, where the field site is considered not just a random place. Crucially, like the 
laboratory, it is also found to be a site of scientific labor, as well as a distinct cultural 
category (Kuklick and Kohler 1996; Burkhardt Jr. 1999). In scientific work, the 
field site and the laboratory are recognized as two ways of legitimizing knowledge, 
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each associated with different occupants, vocabularies, materials, and virtues 
(Gieryn 2006). Both sites have been considered opposites that mutually defined 
each other even though in practice they have sometimes been hard to distinguish 
between (Kohler 2002). 

For natural scientists, the field site has remained a place where they encounter 
their object of study in its natural and unspoiled State. The ornithologists quoted in 
this chapter all drew heavily on their recordings in the field. Being there granted 
them the unique authority of personal witnessing. Yet phenomena in the field are 
often also considered particular, unrepeatable, and uncontrollable (Collins and 
Pinch 1993). This makes scientific work in the field a complex task. Moreover, public 
field sites are almost never exclusively scientific domains: Other actors, processes, 
and materials are after all difficult to fence out (Kuklick and Kohler 1996). 
Consequently, the field is associated not just with immediate experience but also 
with distraction, lack of control, and imprecisión. 

Modern laboratories, on the other hand, are claimed to grant scientists control 
over their objects of study (Latour and Woolgar 1986). Laboratories create their 
objects of investigation by detaching natural phenomena from their original 
context and environment. It is this isolation of the object against the standardized 
and sterile background of the laboratory that is said to yield experimental results 
that are “objective” and <c universally true” (Kohler 2002). At the same time, 
the lab enables scientists to measure, manipúlate, and control natural objects 
(Knorr-Cetina 1999). 

Recent studies have shown this issue of control and isolation to be in place also 
for laboratory sound. Cyrus Mody (2005), for instance, describes how sound can be 
perceived as a sonic contaminant that potentially disturbs scientists’ experimental 
setups. Droning infrastructure or outside trafile noises challenge the conviction 
that laboratories not only look and perform alike but may also sound alike. Moreover, 
this perception of acoustic pollution is by no means exclusive to high-tech, modern- 
day laboratories. Henning Schmidgen (2003) shows that even in the nineteenth 
century, psychological experiments were found to be heavily disturbed by noise 
from outside and even inside the isolated rooms of subject and experimenter. 
A lack of what could be termed sonic sterility in this laboratory context apparently 
challenges what scientists regard as faithful, reliable, and objective practice. 

Laboratories, in contrast to field sites, are places where, also sonically, control 
is desired. This study expands this work on sound in scientific practices by focusing 
on a “nonlab” environment. It investigates the ways in which ornithologists and 
recordists have dealt with a working environment where isolation of and control 
over natural acoustic phenomena are particularly hard to achieve. Work by Thomas 
Gieryn (2006) can suitably serve to conceptualize the relation between laboratory 
and field site. As Gieryn demonstrates, this relation is a dynamic one. Urban soci- 
ologists would in their papers continuously oscillate between such categories of lab 
and field. By rhetorically constructing their work as an experimental laboratory 
practice, as well as an observational field practice, these sociologists have managed 
to relate to the epistemic authority of both places. Here, laboratory and field feature 
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not so much as physical workplaces but as textual constructions or frames of mind 
that “legitimize” these scientists’ results. 

Gieryn identifies three dimensions to schematize how researchers shuttle back 
and forth between their constructions of the city as a lab or field. First, at the field 
site, objects ar efound naturally, while in the laboratory they are made , crafted first 
into suitable objects of research. Furthermore, at the field site, objects of research 
connect intimately with the place of research, while the laboratory practice inten- 
tionally constructs environments and objects that are both generic and universal. 
Finally, field and lab position analysts in different ways vis-á-vis their objects of 
study. At the field site, researchers get immersed — their research object is every- 
where around — while the laboratory creates a distance between researcher and 
object: “ [ A]long with the white coat comes a detached objective view from nowhere. 
Elements . . . are manipulated in a passionless, mechanical, and antiseptic way” 
(Gieryn 2006, 11). 

Drawing upon these dimensions (ready-to-use, specific context, and immer- 
sion versus craft-to-use, generic context, and detachment), I argüe that although 
nature recordists necessarily recorded their specimens in the field, their recordings 
were also made to fit Gieryn’s category of the lab. Using mechanical recording tech- 
niques, these recordists crafted the sounds they collected in the field in ways that 
ultimately problematizes the distinction between lab and field. 


Naturalists Listening: Graphic 

ScORES OF BlRDSONG 


Since the end of the nineteenth century, nonprofessional naturalists 4 and university 
biologists have increasingly turned from museum collections to field sites and zoo- 
logical laboratories to systematically observe living animáis’ behavior (Burkhardt Jr. 
2005). 5 Along with the rise of ethology and behavioral biology in the first decades 
of the twentieth century, the field gained special importance for ornithologists 
interested in the function of birdsong. Which setting, however, granted the best 
scientific insights into the animáis’ natural behavior remained an issue of conten- 
tion (Burkhardt Jr. 2005). In the 1920S, for instance, the young recordist Ludwig 
Koch, whom I discuss in more detail later, was told by his mentor, Professor Oscar 
Heinroth — one of ethology’s founding fathers — that his attempts at recording birds 
in the open were “merely a circus performance” (Koch 1955). After all, Heinroth 
argued, birds behave the same in captivity as in natural surroundings. However, 
even then, naturalists appealed to the field site to make their observations of 
songbirds. 

In doing so, field ornithologists in the first decades of the twentieth century 
devised a whole array of recording techniques to replace the musical notation 
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technique that was common then. Simpler graphic and phonetic techniques were 
thought to be more practical for the demanding job of field recording and identifi- 
cation. In addition, American naturalist Saunders (1916) claimed, existing methods 
of recording birdsong gave no satisfactory impression of the actual song. Musical 
notation only made a natural song fit the rules of an artificial system: “We cannot 
fit wild bird songs to our standards of music” (Saunders 1916). Thus, a practical 
method was needed that discarded the “mechanical rules of human music, without 
losing any of its scientific accuracy” (Saunders 1915). Saunders proposed to 
represent the pitch and duration of a song on a schematic graph along two 
axes, which, he claimed, did not obscure variations in sound as a musical script 
would. 

His proposal met with controversy, however. A musician, naturalist, and avid 
advócate of musical notation in the American Ornithologists 5 Union, Robert Moore 
objected that Saunders’s graphical system was actually less comprehensive because 
it obscured those features of interest to the student of birdsong that musical nota- 
tion had represented far better (1915). Moore’s skepticism triggered an exchange of 
letters that led Saunders eventually to discard his opponenfs arguments as nonsci- 
entific: a [W]e must realize that it is our intention to study bird songs, not from the 
standpoint of a musician but from that of a scientist.” Moreover, “the Science nec- 
essary to the student of bird songs consists almost entirely of the physics of sound, 
not the use of technical musical terms” (Saunders 1916, 103). Saunders may have 
exaggerated here, for his graphic method still plotted sound against a musical scale 
rather than in acoustic frequencies. Still, this remark does suggest a desire to 
systematize the analysis of bird sound by recording it in a “more scientific” 
way — different from the musical techniques naturalists usually applied. 

This was a quarrel over what constituted the right scientific methodology and 
how to listen to sound. Moore seemed to recognize in the bird utterances harmonic 
relations and a distinct meter. Saunders, on the other hand, rendered pitch in 
microtones — without key or meter (figure 3.2). His graphic technique, in other 
words, was not just easier for ornithologists without any musical background to 
apply; it also provided a different way of analyzing the sound. Many such claims 
emerged around the time: some believed the quality of a bird 5 s sound should be 
compared to musical instruments, and Hunt (1923, 202), for instance, proposed to 
represent bird sounds into English phonetics “since bird sounds are essentially 
human speech-like.” Although all of these recordists stressed the need for a uni- 
form method of recording, they did not seem to agree on what it was they were 
actually listening to. At the same time, however, even when modeling bird vocaliza- 
tions on speech, these critics of a “musical” approach to birdsong did not abandon 
the common term birdsong to denomínate their unit of analysis. 

This confusión over the preferred way of recording and analyzing birdsong 
persisted well into the twentieth century. At the same time, however, some orni- 
thologists had begun to expect much of mechanical recording in their field. Writing 
in 1929, Aretas Saunders commented: “Perhaps some day we can devise a phono- 
graph that can amplify bird songs sufficiently to record those of wild birds. Then we 
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Song of the Field Sparrow. West Haven, Conn., April 18, 1914, 9 A. M. 

Figure 5.2 A graphic recording of a field sparrow song by Aretas A. Saunders. From 
Aretas A. Saunders, “Some Suggestions for Better Methods of Recording and Studying 

Bird Song,” Auk 32 ( 2 ) (1915)- 173~83> 175- 

shall be able to play the records over as much as we like and analyze the song in 
detail” (Saunders 1929, 7). Likewise, ornithologist Lucy Coffin believed that “it 
would be worth while . . . to thoroughly investigate the latest developments in pho- 
nograph recording in relation to the recording of bird songs” (1928, 99). 6 These 
quotes demónstrate a shared but not very specific optimism in the potential of 
mechanical recording for field practices. 


Mechanical Recording: 

OuT IN THE WlLD 


As early as the spring of 1929, for some the advantages of mechanical recording over 
other techniques took a more concrete form. A movie production company 
appealed to the expertise of Dr. Arthur A. Alien, an eminent scholar of bird behav- 
ior at Cornell University. The Fox-Case Movietone Corporation sought to record 
birds in the wild to accompany one of their films and asked the Cornell ornitholo- 
gists to help them approach their subjects. Eager to collaborate, the ornithologists 
found themselves confronted with a truck equipped with microphones, electrical 
amplifiers, and, most important, a “sound camera” that recorded sound waves into 
a black-and-white inscription on movie film. 7 Alien and his colleague Peter Kellogg 
saw some exciting implications of these devices for their own field studies. In hind- 
sight, as their associate at the American Museum of Natural History Albert Brand 
put it, “previous methods . . . were, at best, only makeshifts, awaiting the time 
Science should have advanced sufficiently so that faithful reproductions . . . could be 
made” (Brand 1932, 436). 8 
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In the same period, the Germán and British entertainment industries also 
sought the cooperation of ornithologists to start mechanically taking birdsong 
out of doors. Together with ornithologist Oscar Heinroth and under the supervi- 
sión of producer Ludwig Koch, the Germán record company Cari Lindstróm 
published a first bird sound book in 1935. Koch soon moved to Great Britain, 
where he set up long-term collaborations with well-known British ornithologists 
Julián Huxley and Max Nicholson and also with the Parlophone record company 
(Koch 1955). These producers relied on ornithologists 5 field expertise to guide them 
in the field, while the professional ornithologists were able to familiarize themselves 
with the latest sound-recording technologies: sound film photography at Cornell 
and Parlophone’s disc-cutting phonographs in the UK. Mechanical recording 
practices now required expertise in engineering and physics rather than musical 
theory. 

However, while Koch and his team of ornithologists and engineers continued 
recording for Parlophone and the British Broadcasting Company until the 1950S, 
the contacts between the Cornell University Ornithological Laboratory and the 
movie industry were rather short lived. Although the Cornell ornithologists thus 
were no longer able to draw on the expertise of specialized recording engineers and 
technicians, the department decided to purchase its own equipment for scientific 
research. 9 After all, Brand noted, “many of the secrets of avian life are hidden in 
an understanding of the meaning of the song.” With phonograph recordings, 
“the serious student of ornithology could study song in a way that has been impos- 
sible heretofore. 55 So, he added, “if the motion picture industry can take sound 
out-of-doors, the naturalist should also be able to do so 55 (Brand 1932, 436). 

Still, despite the availability of electrical amplification and microphones, 
recording birdsongs outdoors remained a daunting task. So much equipment was 
required that it all had to be transported in a van that was so big that “the pólice 
[would] not allow it to be left on the public highway 55 (Nicholson and Koch 1937, 
30). The sound truck carried the amplifier and múltiple microphones, as well as a 
great number of storage and dry cell batteries for the power supply. Moreover, a 
phonograph would cut the recording in wax dises. To make the needle run smoothly 
on the wax, the recordists had to soften it in a large electric oven. Finally, to make 
the needle run true on the wax, the phonograph and recording van had to be pre- 
cisely leveled so the equipment could stand upright. Indeed, the recordists them- 
selves would refer to the recording truck as a studio, similar to those at Abbey Road 
(Nicholson and Koch 1937). 

Transporting this studio into the wild was a logistic tour de forcé not only 
because of the diverse staff that had to opérate the van and its equipment but also 
to get the studio in place. Whereas the recordist could depart from the tracks to 
follow the bird to its perch, the heavy recording equipment was less flexible in use. 
Roads, paths, and trails formed a network of mobility that became vital to the logis- 
tics of nature recording but also enforced some restrictions. As Brand warned 
his readers, “It is not as simple as it would seem to get a location where there is 
absolute quiet . . . Too great proximity to a traffic road, for instance, makes recording 
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impossible” (Brand 1932, 438). Henee, the recordists 5 reliance on roads to reach 
distant areas for recording had an ironic effect: It produced noisy surroundings. 

Both Brand and Koch complained that their recordings of bird sound were 
often interrupted by a passing airplane or turned out to be unfeasible because of the 
hum of a distant highway (Brand 1932; Koch 1955). The ornithologist Max Nicholson 
had accompanied Koch on some of his recording excursions into the British coun- 
tryside and recalled how such infrastructure often drastically complicated their 
attempts at recording: 

Aeroplanes, motor-cars, lorries, trains, and motor-bieyeles combined to shatter 
the tranquility which had been so perfect a few hours before. Just as smoke 
pollution helps to swamp a town under fog, so the natural peace of the country 
was drowned under the indefinable hum of distant engines and wheels .... Until 
one has listened objectively to all these sounds coming through the loud-speaker, 
in what counts still as a peaceful retreat from the bustle of London, it is hard to 
realize what a noise-ridden world we have managed to make ourselves live in. 
(Nicholson and Koch 1937, 38) 

The comparison of noise to a fog that “swamps” and “drowns out” other sounds 
evokes a sense of its ungraspable omnipresence. These recordists 5 aversión to traffic 
noise was part of a more general resistance to all kinds of mechanically generated 
noise emerging in the first decades of the twentieth century. Historians of sound 
and noise Emily Thompson (2002) and Karin Bijsterveld (2001, 2008) have described 
significant antinoise campaigns in American and European cities between 1910 and 
1940. After 1925, the decibel standard and technical instruments such as acoustic 
meters were called upon for a more systematic and “objective” measurement and 
comparison of city noise levels. 

It is then no coincidence that those who would abate noise increasingly consid- 
ered noise a problem in this “mechanical age” (Bijsterveld 2001). Not only did 
mechanical artifaets and “machinery” generate more noise, but mechanical mea- 
suring devices or recording technology also made noise increasingly audible. 10 As 
such, the recordists 5 audible experience changed into one of a noisy and uncontrol- 
lable field site. As Koch explained, “the sensitive microphone takes up all noises, 
often within several miles 5 radius, and exaggerates them” (Nicholson and Koch 
1937, 20). At Cornell, Brand concluded that “a mechanical ear . . . transmits all the 
sounds it hears, without discrimination. Though man has been able to invent a 
mechanical ear — the microphone, and it is an extremely sensitive one, too — he has 
not been able to equip it with a brain” (Brand 1934, 19). 

A mechanical ear recorded sound not just as it was heard by humans: It also 
assumed an indiscriminate, unselective registration. “Heretofore,” Brand stated, 
“the bird song student has had to rely entirely on auditory impressions,” but with 
mechanical devices such as the sound camera, “an objective médium of study is 
now available” (Brand 1935, 192). This straightforward equation of objectivity with 
a mechanical apparatus Lorraine Daston and Peter Galison (2007) have termed 
mechanical objectivity. Until the mid-nineteenth century, they found, scientists 
called upon their experience and personal judgment to decide what to represent 
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and how to do so. Yet increasingly, scientists considered such practices of selection 
and idealizaron to be “merely subjective” projections, disturbed by an artistic 
fancy. Instead, scientists valued data that were “unmarked by prejudice, skill, judg- 
ment, or interpretation” (Daston and Galison 2007, 17)* To comply with this emerg- 
ing morale of nonintervention and abstinence, they delegated as many of their tasks 
as possible to mechanical procedures or automatic registration techniques. Because 
mechanical reproductions were believed to be guided by protocol, scientists con- 
sidered them impartial and objective. Similarly, mechanical recording assumed a 
superior means of sound reproduction than artisanal notation techniques. 

In sum, the introduction of mechanical recording changed the field of orni- 
thology in múltiple ways. It did so first in a practical sense: Instead of a naturalist 
with a good musical ear and a notebook, fieldwork now involved specific, costly 
equipment and a number of skilled professionals to opérate it. However, it also 
transformed the audible experience of the recordist: Bird sounds were suddenly 
drowned out by the hum of trafile, industry, and vacationers, qualifying the field as 
a particularly noisy and somewhat uncontrollable environment. Finally, mechani- 
cal devices redefined the understanding of a sound recording into mechanically 
mediated contení. What made a good recording was no longer what Saunders and 
Moore had called the a accuracy” of a human observer but what Brand and Nicholson 
perceived as the a objectivity” produced by the recording device. Both aimed to 
reduce interpretation from the act of recording, but while Saunders relied on a dis- 
interested observer and a nonmusical system, Brand disqualified the observer in 
favor of a “neutral recording machine.” Mechanical devices thus rendered artisanal 
techniques of recording not just inaccurate but also subjective . 

Mechanical recording partially solved a problem of faithful reproduction with 
which ornithologists had earlier struggled. At the same time, however, it also cre- 
ated a new one: the perception of noise. As such, mechanical listening generated a 
new experience of field sound. How, then, did the sound recording teams in London 
and Cornell come to deal with the presence of such unpleasant yet “objective” 
noises? 


Dealing with the Field: 

Studio Recordists and Ornithologists 


Koch and his British recording team dealt with the realities of field recording by 
employing a strategy common in the studio recording practices of the day. While in 
the field, they would adjust sound levels by a strategic placement of microphones 
and the singing bird (which sometimes had to be persuaded to perch near the 
microphone). Historian of sound technology Susan Horning (2004) has argued 
that, since the early days of sound recording, one of the most important skills 
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acoustical recordists had to acquire was to know where to position the sound source 
with respect to the recording element. Without amplification of the source, this 
tinkering in real time was the only way to adjust the balance between singer and the 
various instruments on the recording. However, even when recording turned electric 
in the 1920S and the sound quality drastically improved, the greater sensitivity of the 
microphone also demanded greater skill in placing it in relation to the sound source, 
and this expertise could be acquired only by trial and error. It was this particular skill 
that the recording engineers and technicians in Koch’s unit had learned in the studio. 
Unlike Koch, these men had little or no experience of working outdoors. 

Koch and his team outlined an approach similar to that used in studio record- 
ing when he advanced that a real birdsong hunter must “with skill and much 
patience go about the job of bringing the songster as near as possible to his micro- 
phone, or rather the microphone as near as possible to the songster” (Koch 1955, 3). 
At all times, Koch had to arrange for the microphone to be near the bird “as if he 
were a performer in a studio” (Koch 1955, 73). Despite these arrangements, how- 
ever, he also realized that the conditions of the field were considerably different 
from those in a studio. In fact, he preferred the former. “If one wishes to avoid such 
natural disturbances, then only recording in a closed room can be considered, and, 
as I have found in my own experience, in such altered acoustical conditions the 
bird-notes are distorted and no longer make a natural record” (Nicholson and 
Koch 1937, 21). The natural environment was an important asset for this kind of 
recording. As Julián Huxley repeatedly stated in his introduction to KoclTs first 
British sound book, KoclTs records had “the quality for evoking the bird’s environ- 
ment” and “a true picture of the birds’ voices.” This “natural” sound was what 
Koch was after. “In my nature recording I invariably observe the principie of get- 
ting the bird-notes in question in the foreground by all means, but always with their 
natural background” (Nicholson and Koch 1937, 21). 

However, in the given circumstances, making such a record required a few cru- 
cial interventions. The team had to distinguish between the bird’s voice in the fore- 
ground and the noisy environment in the background. To do so, the recordists 
would arrange up to six different microphones to endose the bird’s song perch, 
thus impro vising a large-scale recording room outdoors (figure 5.3). Each of these 
microphones was linked to a control panel in the van several hundred yards away, 
where an incoming sound to one of them would be cut right into the wax disc. 
Occasionally the bird would fly away, and then the whole disposition of the micro- 
phones would have to be changed, including tests to make sure the microphones 
“cut out as much as possible of interfering noises” (Nicholson and Koch 1937, 37). 
These quotes make it clear that while the recordists considered noise part of 
the reality of field recording, the intended purpose of the recordings also dictated 
that interference was to be reduced as much as possible. Furthermore, as Sterne 
(2003) shows, the act of recording always implies mediation. To make a “natural- 
sounding” recording required at least some engineering and staging of the recorded 
sound, guided by an implicit preference for how nature should actually sound 
(also see Helmreich, this volume). 
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Figure 5.3 Parlophone recording van and setting up several microphones. 
From Max Nicholson and Ludwig Koch, More Songs ofWild Birds 

(London: Witherby, 1937, 5). 


On the other side of the Atlantic, the recordists at the Cornell Laboratory of 
Ornithology were facing similar problems but eventually took a rather different 
approach. After experimenting for some months with the recording equipment, the 
ornithologists experienced some success but also had to acknowledge that noise 
drowned out most of the recorded sounds. “If we were to achieve any kind of 
perfection,” Kellogg recounted, “we would have to develop our own specialized 
equipment and techniques for fieldwork” (Kellogg 1962, 38). The group allied 
with Corneü’s electrical engineering department to rebuild its equipment for this 
purpose. In addition, after some experimentation, the group manufactured a 
parabolic reflector that became an important instrument for the ornithologists 
at Cornell. According to Kellogg, the “parabola for picking up and concentrating 
bird songs on the microphone was probably our greatest piece of good fortune. 
Recording would have been possible without a reflector, but the results with it were 
so superior as to make the instrument a universal tool in this field” (Kellogg 
1962, 39). 

Although Kellogg and Alien would use the reflector for all of their expeditions, 
the instrument was not quite yet universally used, but parabolic reflectors did 
become more common in the 1950S. By then, the microphone could be combined 
with a tape-recording setup — Kellogg helped design the Magnemite model, which 
would become the standard recorder for the next decade. The tape recorder pro- 
vided an acceptable semiportable alternative to the heavy sound camera or disc 
recorder. Since the 1920S, this costly and complex equipment had made sound 
recording the almost exclusive preserve of professionals, but the tape recorder 
and parabolic reflector made field recording more affordable and easier to do, 
attracting also amateur recordists with an interest in natural sound (Kellogg 1962). 
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Especially in the first years of the laboratory’s Library of Natural Sounds, Kellogg 
would instruct amateur recordists on the proper use of the equipment made 
available to them. 11 In this way, parabolic recording could indeed be successfully 
implemented as a standard in field recording. 

But what then, made the parabolic reflector so superior to the ears of the Cornell 
ornithologists? The principie was such that the surface of the parabola reflected 
sound waves to a dynamic microphone at its focal point (figure 5.4). Focusing the 
sound waves like this drastically increased the input to the recording equipment. As 
a result, the reflector would help to record sound at a much greater distance than 
microphones in an ordinary setup. Consequently, bird sounds could be recorded at 
distance, in flight or in inaccessible places — an added advantage to students of 
animal behavior in the wild. Equally important was that directional recording 
amplified only those sounds at which the recordist directed the parabola, which 
permitted a greater selectivity in recording. Albert Brand explained that “the out- 
side noises are very nearly shut out, while the sounds wanted are greatly increased” 
(Brand 1934, 23). As an article in Science News Letter read, it was a way to “overeóme 



Figure 5.4 Focusing a parabolic recorder. From Albert R. Brand, Songs ofWild Birds 

(New York: Nelson, 1934, 2). 
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much of the handicap imposed by lack of soundproof studios where the wild birds 
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Recordists could more easily elimínate interference from other ambient 
sounds. By these means both the Cornell ornithologists and Koch’s team attempted 
to minimize noise interference on their recordings. The parabolic reflector, 
however, did enable a differ kind of control. While the natural environment 
featured prominently in the background of Koch’s recordings, the parabolic reflec- 
tor technique excluded most other sounds. It produced a close-up recording: a 
sound that was sonorous and had a high signal-to-noise ratio but provided little 
information about its context. The analogy of binoculars or a gunsight mounted on 
the reflector is especially useful here. Just as the binoculars at the same time frame 
and magnify a distant image, so the parabolic microphone intensifies only those 
distant sounds that occur within its limited and directional scope. The technique 
had an interesting impact on the recording: depending on the level and proximity 
of ambient noise, it created a “sterile” background — indeed, a sonic sterility that 
seems typical of the soundproof studio that carne into being in the 1930S, 
a space that constituted clear and controlled, direct and nonreverberant sound 
(Thompson 2002). 

The parabolic reflector produced a kind of sound studio that was different 
from Koch’s approach. More specifically, it also seemed to “laboratorize” the 
recording in a way that evokes Gieryn’s schematic distinction between the 
laboratory and the field site. For the parabolic reflector was not only used to 
reduce some of the ambient noise in the recording. By reinforcing only those 
sounds at which the reflector was aimed, it also isolated and detached them 
from their surroundings, enforcing an aesthetic of the individual. Although in prac- 
tice it may have been hard to elimínate all superfluous sounds from the recording, 
the technique was different from Koch’s in that it did not aim to record the bird 
against a natural background but to catch the sound of the bird and nothing but 
that. These recordings revealed little about the local context. They did not evoke the 
bird’s environment in the way ornithologist Julián Huxley appreciated Koch’s 
recordings; they were not created to have properties such as the “fullness, immedi- 
acy, and emotional completeness” of an aural immersion (Nicholson and Koch 
1936, xiv). 

This is of course not to suggest that KoclTs recordings could not sound sono- 
rous or cióse up. The point is that the parabolic recorder was designed to create 
close-up recordings within a panoramic, immersive, and noisy natural soundscape. 
In effect, bird sound was mechanically detached from a specific context. Sound 
recording was physically detached by enabling a greater distance in the field between 
the recordist and the singing bird. However, by suppressing contextual sound, it 
also detached the sound by making it resound “without perspective” or sound as 
though it were “coming from nowhere.” It is this particular way of crafting a sound 
by suppressing its context and surroundings that enhances sonic sterility and frames 
recorded natural sound in a “laboratory” way. 
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To further conceptualize this “laboratorization” of the field, an essay by art 
historian Svetlana Alpers (1998) is revealing. In exploring an analogy between the 
artist’s studio and the scientific laboratory, she regards both not only as physical 
workplaces of artists and scientists but more fundamentally also as an instrument 
and cc a frame of mind.” Her discussion of the relation between the landscape and 
the artistic studio is especially noteworthy here. In an accepted historical account of 
nineteenth-century landscape painting, painters freed themselves from studio 
conventions and began painting “real landscapes” instead. However, Alpers 
claims, whether painted outside or inside, their work was not necessarily studio free. 
Even in landscape painting, she finds, a landscape found outdoors could still 
be transformed into what seemed to be an object composed in the studio: Even 
outside at the field site, the landscape could be represented as a still-life motif, 
reducing the expanse of the landscape to the static and focused presence of an 
object. 

I do not intend to evalúate this observation on its art historical merit but use it 
heuristically to highlight two things. First, following Alpers’s account, the “studio 
frame” can be operated as effectively by the artist in the field as it can by the studio 
painter. Likewise, a “laboratory frame” need not be restricted to experimental 
cultures within laboratory walls. Even a contingent, often distorted and hard-to- 
control field site may be materialized, conceptualized, or presented as a space that 
incorpora tes laboratory-like features (Bont 2009; Gieryn 2006; Kohler 2002). 
However, as the case of parabolic recording demonstrates, such features may also 
remain very much implicit and hidden in the material conditions of recording. 
Second, Alpers’s discussion of landscape painting calis for an analogy to wildlife 
sound recording. The landscape painters, whether or not adopting a studio frame 
in their work, translated a three-dimensional surrounding onto a fíat canvas. 
Similarly, the directional way of parabolic sound recording seems to position the 
listener infront ofa framed auditory canvas rather than in it. Like the painting and 
the observer, the design of the sound recording does not immerse the listener. 

Intriguingly, it is difficult to articúlate the effect of directional recording with- 
out resorting to visual analogies such as “cióse up,” “perspective,” “focus,” or 
“frame.” Of course, this is not to suggest that ornithologists themselves consciously 
employed such analogies to develop this recording technique. Rather, this example 
puts into a critical perspective what media historian Jonathan Sterne once dubbed 
an “audiovisual litany” (2003, 18). Many theorists and historians of sound, Sterne 
claims, have taken refuge in a transhistorical idealization of the nature of sound. In 
this customary conception, only hearing puts the individual into cióse contact with 
the world; it is essentially immersive, subjective, and affective. This has been con- 
trasted to visión — generally considered directional, enhancing distance and objec- 
tivity. The history of parabolic recording, in which hearing is not immersive but 
directional, distancing, and objectifying, warns against a conception of sound and 
hearing that emphasizes its putative transcultural and transhistorical features (see 
also Smith, this volume). 
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Analyzing Sound: Listening with 

Eyes and Hands? 


Mechanical recording changed not only the way ornithologists in the field experi- 
enced bird sound but also how they treated it. Yet, although mechanical devices 
provided a way to experience and record sound more “objectively” and to deal with 
the consequences of such “faithful reproduction,” it did not do away with the initial 
problem of how to describe or even analyze bird sound when out of the field (Daston 
and Galison 2007). In a paper titled CÍ Why Bird Song Can Not Be Described 
Adequately” (1937), Brand relates how listening to his own phonograph records 
would stir discussion with Dr. Alien. Although both were experienced students of 
bird sound, they each heard different resemblances between the vocalizations they 
compared. However “objective” the recording might be when played in the lab, he 
concluded that “the subjective reaction of the listener is much the same as in the 
open” (Brand 1937, 14). 

Brand had personally witnessed a series of experiments at Cornell that demon- 
strated that considerable “fading points” existed in the hearing spectra of most 
people. Moreover, listening was shown to have a significant psychological factor: 
“[W]e hear what we are listening for and what we expect to hear,” for “it is impos- 
sible to sepárate the hearing apparatus from the thinking mechanism” (Brand 1937, 
12). If earlier Brand regretted not being able to equip the mechanical ear with a 
brain, he now suspected that the brain permeated human hearing with the flaw 
of individual perception. To Brand these experiments proved that it was not 
just recording but also hearing that was to be mistrusted: Listening after all was a 
subjective and individual experience (Brand 1937, 12). 

In contrast, he advanced recording on sound camera film as “a médium from 
which sounds can be studied objectively” (Brand 1938, 63). The camera would 
reproduce an image of recorded sound that, once developed, could be analyzed 
under a microscope. This way, the “extremely short notes and those of very high 
pitch, often inaudible to the human ear, can be clearly seen and studied” (Brand 
1935, 4o)- In an experimental test in 1938, Brand and one of his students, Harold 
Axtell, used a sound film recording to determine that it took an accomplished musi- 
cian about four hearings to correctly estimate the notes of an ordinary utterance 
(1938). Their finding suggested that some of the observations Saunders (1929) had 
collected in what still counted as an authoritative study might actually be up for 
revisión. Moreover, the implication of this juxtaposition of the human listener and 
the visualization by mechanical means was that listening to sound was more 
difficult than looking at it. 

Although this did not keep ornithologists, including Axtell (1938) himself, from 
recording bird sounds by ear, in the following years a series of mechanical devices 
for the visualization of sound, such as the “Vibralyzer,” the oscillograph, and the 
audiospectrograph, were introduced into the analysis of bird sound. 13 From the 
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early 1950S onward, these technologies would provide what was (again) considered 
to be “objective data that are more detailed and accurate than those obtained by 
most methods heretofore used” (Borror and Reese 1953, 276). Most of these tech- 
nologies had been developed during wartime experiments to detect the sounds 
emitted by enemy submarines. Yet after the war, more peaceful applications, such 
as the analysis of animal sounds, were quickly recognized (Potter, Kopp, and Green 
1947). Eventually, one of these technologies, the audiospectrograph, even inspired 
some to compare its valué for the Science of bird sound to that of the microscope 
for (cell) biology (Marler and Slabbekoorn 2004; Baker 2001). 

Why did the audiospectrograph become such a success? One way to account 
for this would of course be to understand the device as another stage in the develop- 
ment of mechanical recording — technology now also contributed to sound analy- 
sis. Additionally, the audiospectrograph connects to other recording technologies 
and techniques in that it inscribes sound. Such inscriptions — nonmechanical such 
as musical scores or mechanical such audiospectrograms — we could cali with Bruno 
Latour (1986) “immutable mobiles.” Like the printing press, the recording device 
makes mobilization and immutability possible at the same time by casting many 
identical copies of a unique sound event and enabling those to be circulated widely. 
By conserving, multiplying, and displacing sound, a recording challenges what may 
be called the “ephemerality” of sound, that is, its uniqueness in time and place. 
Unrecorded sound “evaporates,” while recorded sound is detached from the field 
and made available for others to be repeated at will. Such mechanical inscriptions 
are not necessarily limited to visual technologies such as the sound camera or the 
audiospectrograph. Yet, if with Brand one finds that listening to sound may 
be deceptive and more difficult than looking at sound, what advantage did the 
audiospectrograph then provide? 

According to Latour (1986), inscriptions are not just immutable and mobile. 
They may also be fíat and are thus more easily cut up, recombined, compared, or 
superimposed — in short, dominated. An audiospectrographic comparative analy- 
sis of Western meadowlark songs by Lanyon and Fish (1958) demonstrates how fíat 
and printed inscriptions work differently from sonic inscriptions. The article pres- 
ents a píate of four spectrograms of a Western meadowlark cali note (figure 3.5). 
The calis were recorded in different field locations from California to Wisconsin. 
Referring to the combined spectrograms, the authors concluded that all notes were 
identical. They then stated that “ collectively , [the spectrograms] suggest that Western 
meadowlarks render the same characteristic cali note regardless of their geographi- 
cal location” (Lanyon and Fish 1958, 340). 

This quadruple comparison of evidence is Lanyon and Fish’s central argument 
(Lynch 1988), made possible only because the original recordings were processed 
into inscriptions, which can be juxtaposed and combined especially because 
they are fíat. To understand its valué, one has only to imagine the same fourfold 
presentation of the original recordings but played out loud simultaneously. In 
comparison, two-dimensional, printed inscriptions can be made to “tune” only 
when the observer looks at them — one at a time or all at the same time. 
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Figure 5.5. Audiospectrographic analysis of geographic variation of Western 
meadowlark calis. From Wesley Lanyon and William Fish, “Geographical Variation 
in the Vocalization of the Western Meadowlark,” Condor 60 (1958): 339-41, 340. 

Courtesy Cooper Ornithological Society. 


Henee, the Meadowlark inscriptions do something besides representing sound, 
in Latour’s words (Latour 1986), “synoptically.” They enabled Lanyon and Fish to 
deal with the sequentiality of sound. Aíter all, sound is not only tied to a specific 
time and place (it is ephemeral) but also happens “in time” (it is sequential). Even 
when a recorded bird sound is played again, every note is evidently replaced by the 
next and the next. Thus, where auditory analysis necessarily takes place in time and 
thus takes time, the audiospectrograph stabilizes time through combination and 
scaling. Spectrographic analysis, in other words, allows one to go back and forth 
and cut across sound sections. Recorded tape, on the other hand, may be easy to 
circuíate, but it is cumbersome and time consuming to select, cut, paste, listen, 
rewind, and listen. 

The analogy to the microscope may lead one to mistake the new “objectivity” 
of the audiospectrograph for a purely visual matter — just as the phonograph 
was not more “objective” simply because it was mechanical. Not just a mechani- 
cal inscription machine, then, the audiospectrograph seems to represent another 
phase in a process of gaining control over sound — a process that started with 
musical notation Scripts. An explanaron for the authority of inscriptions should 
not be sought in the single visual inscription but in a continuous process, a cas- 
cading of inscriptions. The audiospectrograph does not make other recording 
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technologies obsolete. Instead, it processes recorded sound further into even flatter 
and more controllable, comparable units of sound. 

Crucially, visualization enabled ornithologists to deal with the dimensión of 
time in a more felicitous way. However, it also permitted them to deal with the 
problem of noise, by erasing it visually. An article by Marler, Kreith, and Tamura 
(1962) signáis a specific approach to visual noise. After their parabolic recordings 
had been processed by the audiospectrograph, the origináis had again been traced 
by hand in pen and ink (figure 5.6). In another article that same year, the authors 
indícate that their images are in fact “photographs of sonograms which were 
retouched with white paint to mask inscriptions and some background noise” 
(Marler and Tamura 1962, 369). 

Both the parabolic microphone and the spectrogram were thus geared 
toward the erasure of noise from the recording. Ornithologists processed recorded 
sound into clean and sterile representations by manually eliminating the mechani- 
cally recorded noise. What Michael Lynch (1988) observed to be sequential trans- 
formations of an original image into a presentable image then concerns the 
whole process of sound recording. It is not just visual displays that are made more 
coherent with each other: Sound recordings — audible and visual — are schematized, 
simplified, and marked more clearly. In this process of schematization, tracings of 
the field conditions disappear; place-specific sounds and noises make place for a 
generic — white and mute — background. 

Mechanical recording categorized noise as a concrete problem — rustle on a 
recording or a few printed spots. Of course, noise did not come into existence with 
mechanical recording, but it certainly did turn problematic. As naturalists heard 
“what they were listening for,” precluding wind, foliage, and trafile, the “indiscrim- 
inate and objective” mechanical ear of the microphone had begun to guide a differ- 
ent perception of the environment. Yet in whatever way noise was experienced, 



Figure 5.6. Ink tracing by hand of an audiospectrogram. From Peter Marler, Marcia 
Kreith, and Miwako Tamura, “Song Development in Hand-Raised Oregon Juncos,” 
Condory 9(1) (1962): 12-30, 15. Courtesy Cooper Ornithological Society. 
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both mechanical and nonmechanical inscriptions remained silent about it. This 
does not mean that the field site itself became meaningless. Paradoxically even, the 
detailed and time-economic analysis of sonograms at the same time also enabled 
the geographical variation in birdsong to become a topic of growing interest (Baker 
2001). However, while the field since the early decades of the twentieth century 
gained relevance as a place where ornithologists could observe the natural behavior 
of birds, it also emerged as a place that could be actively controlled and silenced. 
The impact of this conception of the field site on bird sound recording is best illus- 
trated by the fact that the influence of a noisy habitat on bird behavior has only 
recently become an important topic of research. In the last two decades, researchers 
have begun paying systematic attention to changes in birds’ singing behavior 
because of interference from city or traffic noises (Slabbekoorn 2004). 


Conclusión 


In this chapter I have explored ways in which ornithologists appropriated and 
redefined listening and recording as meaningful modes of acquiring scientific 
knowledge in the twentieth century. By analyzing two periods of time, I investi- 
gated how ornithologists made conscious decisions on what to record and how 
to record it. These decisions were structured by recording techniques and techno- 
logies. As listening became entangled with recording, editing, measuring, and 
reading, the parabolic microphone and the spectrograph represented cascading 
technologies of increasing control over sound, exemplified by a silenced and white 
background. It is through this sterile environment that the field site connects to the 
laboratory, as unwanted sound is intentionally eliminated. 

At the same time, the point at which the field ends and the lab begins became 
blurred. In 1950S, for instance, ethologists Bill Thorpe and Robert Hinde (1956) at 
Cambridge University fashioned the construction plan of their ornithological field 
station in Madingley after a standard soundproof room. Sound insulation enabled 
them to do their experimental work on the learning of birdsong, they said, without 
too much intrusión from outside noises. Insulating the field station against 
unwanted sound, Thorpe and Hinde seem to merge the strategy of a modern labo- 
ratory with that of the modern studio (Thompson 2002). As Kohler (2002) has 
argued, a field station blurs the categories of laboratory and field. It becomes a way 
of drawing on the unique characteristics of the field site, while simultaneously 
enforcing a laboratory-like control over its contingencies. Indeed, also at Madingley 
a lablike control of sound was focused on particular frequencies of sound for spe- 
cific experiments, while purposely disregarding others. 

Like the Madingley soundproof room, the recording techniques analyzed in 
this chapter blur the boundaries between field, studio, and laboratory. They merged 
these categories not only by introducing a mobile studio in the field but even more 
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intricately through the parabolic microphone and audiospectrogram. While deal- 
ing explicitly with sound located and recorded in the field — whether in New York 
or California — these instruments fabricated a crisp, controlled, and context-ridden 
sound. These features may be associated with the sonic control of a studio, but they 
can also be integrated in the “passionless” crafting, generic context, and detached 
approach of the scientific laboratory. These mechanical instruments ¡Ilústrate a sci- 
entific culture in which places of Science are demarcated by sterility and silence and 
underscore the need to understand how this scheme is enforced by and reinforces 
wider cultural expressions of modern sound control. 

NOTES 


1 I would like to thank Karin Bijsterveld, Tessa Fox, Fabian de Kloe, Julia Kursell, 

Mara Mills, Trevor Pinch, and Alexandra Supper for their feedback on earlier draít. 

2 “Blind Tom” refers to an African American musical savant who lived in the latter 
half of the nineteenth century. In this context, he exemplifies the exceptionally skilled 
listener. 

3 At that time, in other scientific disciplines the potential of phonograph recording for 
fieldwork was also acknowledged: Radick (2000) describes the recording of captive and 
wild primate sounds on wax cylinder and their use in evolutionary philology and biology 
since 1892. In addition, Stangl (2000), Ames (2003), Brady (1999), and Sterne (2003) 
discuss the use of phonography for ethnology, comparative musicology, and anthropology 
since the late nineteenth century. 

4 In the United States and especially in Britain, tenured positions in behavioral 
biology were limited. Most work in ornithology was performed by amateur naturalists 
(Burkhardt Jr. 2005). 

5 Before the late nineteenth century, so-called armchair natural history was most 
dominant. Ornithological study consisted mainly of taxonomic classification based on 
museum collections of specimens. 

6 Although recordings of captive birds’ songs had been available as early as 
the late nineteenth century, they were of little interest to Saunders and other field 
ornithologists. These recordings, often of talented songsters and mimics, were intended 
for entertainment, not for naturalistic research (Copeland, Boswall, and Petts 1988; 

Brand 1934). 

7 The photographic sound recorder transforms sound waves into electrical energy, 
which makes a small glow tube flicker. The light, which passes through a small air slit, 
triggers a Chemical change in a passing film. Thus being exposed at a constant speed, the 
film produces a black-and-white inscription that can later be played back as sound again. 
For a technical history of photographic sound recording, see Kellogg (1955). 

8 Emphasis mine. 

9 Even though the Cornell ornithologists intended their recordings primarily for 
scientific use, between 1932 and 1936 they also published a series of gramophone records 
of wild bird songs that were taken from their original film recordings. The royalties of 
these popular recordings helped to finance further study and recording of birdsong 
(Kellogg 1962 ). 

10 Natural noise, such as wind, water, rain showers, foliage, and other birds calling, 
provided an equally challenging soundscape to work in. 
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11 See, for instance, Stillwell (1964). 

12 “Reflectors like Airplane Detectors Catch Bird Songs,” Science News Letter (Nov. 10, 
1934): 299-300. 

13 However, not all of these technologies visualized sound in the same way. They 
represented recorded sound on a time scale, indicating amplitude but not frequency or 
pitch. The sonagraph, on the other hand, represented recorded sound also on a time scale, 
indicating frequency and giving some impression of amplitude. See also Rieger (2009). 
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CHAPTER 6 


UNDERWATER 
MUSIC: TUNING 
COMPOSITION TO 
THE SOUNDS OF 

SCIENCE 


STEFAN HELMREICH 


Introduction 


How should we apprehend sounds subaqueous and submarine? As humans, our 
access to underwater sonic realms is modulated by means fleshy and technological. 
Bones, endolymph fluid, cilia, hydrophones, and sonar equipment are just a few 
apparatuses that bring watery sounds into human audio worlds. As this list sug- 
gests, the media through which humans hear sound under water can reach from the 
scale of the singular biological body up through the socially distributed and techno- 
logically tuned-in community. For the social scale, which is peopled by submari- 
ners, physical oceanographers, marine biologists, and others, the underwater 
world — and the undersea world in particular — often emerges as a “field” (a wildish, 
distributed space for investigation) and occasionally as a “lab” (a contained place 
for controlled experiments). 

In this chapter I investígate the ways the underwater realm manifests as such a 
scientifically, technologically, and epistemologically apprehensible zone. I do so 
by auditing underwater music, a genre of twentieth- and twenty-first-century 
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composition performed or recorded under water in settings ranging from swim- 
ming pools to the ocean, with playback unfolding above water or beneath. 
Composers of underwater music are especially curious about scientific accounts of 
how sound behaves in water and eager to acquire technologies of subaqueous sound 
production. We can learn much about how the underwater domain has been made 
sonically perceptible by attending to how composers adapt their practice to scien- 
tific language and technique in ways both rigorous and fanciful. We can learn how 
sound has been abstracted from the water médium to reveal and produce resources 
imagined as musical. We can track how technologies of underwater audition 
are often adjusted to deliver aesthetic experiences in line with the way composers 
imagine submerged sound should sound; how, to take one example, the notion of 
water as sublimely immersive can be reinforced in compositions that make use of 
hydrophonic listening and playback. We can also sometimes discern a querying 
of dominant thinking about the symbolism of underwater sound. 1 

One tradition in the history of sound tells us the ocean was once taken to be a 
place of silence (thus, in 1953, Jacques Cousteau’s book The Silent World). Auguste 
and Jacques Piccard the same year described travel two miles down in their bathy- 
scaphe Trieste as surrounded by the “quiet of death” (Long 1953, quoted in Schwartz, 
forthcoming). That tone had been set in 1896, when Kipling wrote in his poem “The 
Deep-Sea Cables/’ “There is no sound, no echo of sound, in the deserts of the deep” 
(resonating with early nineteenth-century theories of the deep as a lifeless “azoic 
zone”). However, there has existed a more sonorous imagination of the sea — think 
of singing mermaids and sea monsters. In Charles Kingsley’s 1863 novel, The Water- 
Babies , the boy protagonist, approaching a submarine volcano, comes a to the white 
lap of the great Sea-mother, ten thousand fathoms deep . . . aware of a hissing and 
a roaring, and thumping, and a pumping, as of all the steam engines of the world at 
once” (quoted in Kaharl 1989, xiii). As we will hear, the underwater world was, even 
in its first scientific manifestations, full of sound — even music — echoed in the 
poetic descriptions such vibration often called forth. Such soundful seas found 
expression in Romantic musical efforts to evoke underwater realms, which 
bequeathed a store of symbolism to later music meant to be realized under water. 
Notions of the immersive and sublime continué to satúrate audio work. However, 
listening closely to such work, as this chapter does, also reveáis how underwater 
music tracks shifting perceptions of the sea (from a space of cold-war mystery to a 
commons imperiled by global warming), changing ways of inhabiting swimming 
pools (primarily implicating gender), and fashions of connecting sound, art, 
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careful readings, I also thank Etienne Benson, Caitlin Berrigan, Beth Coleman, Kieran 
Downes, Douglas Kahn, Ernst Karel, Heather Paxson, Tara Rodgers, Sophia Roosth, Michael 
Rossi, Hillel Schwartz, Nick Seaver, Malcolm Shick, Nicole Starosielski, and Peter 
Whincop. 
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and Science in contemporary practices such as field recording, sampling, and 
sound art. 


Evoking, Invoking, Soaking 


I distinguish three modes through which music meets water. In the first, musical com- 
position or performance evokes water symbolically, metaphorically, or timbrally — in 
the arrangement of notes, the organization of rhythms, or the choice of instrument. 2 
In the second, music invokes water as a material instrument or sonic element. In the 
third, music soaks in water — that is, music is immersed in actual water as an encom- 
passing médium within which it is performed, recorded, played back, or listened to 
(see the end of the chapter for a musicography organized according to evoking , 
invoking , and soaking). This chapter concentrates on the third mode. 

Before focusing on soaking, however, here are some notes on evoking and 
invoking: 

Evoking: Romantic composers Berlioz, Debussy, and Ravel, along with mod- 
ernists Satie and Schoenberg, are known for portraying tranquil and tumultuous 
seas with an orchestral palette. 3 Acoustic ecologist R. Murray Schafer tested the 
waters of scientific seas in 1978 with String Quartet no. 2, Waves , which offered 
“dynamic, undulating wave patterns, the rhythm and structure of which [were] 
based on his analysis of wave patterns off both the Pacific and Atlantic coasts of 
Cañada” (Knight 2006, 58). 4 The undersea world carne into palpability in sound- 
tracks to Science documentary and Science fiction film and televisión. Cousteau’s 
The Silent World (1956) — not at all silent (bubble noises and scuba breathing 
abound) — features an Yves Baudrier score, towed down by dropping cello lines, 
suspended with gurgly horns, and buoyed up with tinkling harp notes (indebted 
to Liszt’s 1877 “The Fountains of the Villa d’Este,” full of rippling piano 5 ). 6 In sound- 
tracks to 20,000 Leagues under the Sea (Paul J. Smith 1954) and The Deep (John 
Barry 1977), composers evoke the undersea with arpeggiating harps and minor- 
keyed swelling strings. In line with apprehensions of the ocean as a feminized, mys- 
terious other, Western composers often employ orientalist motifs. In Angela 
Morley’s 1969 “Martineau and Organ,” from Captain Nemo and the Underwater 
City , the Mellotron organ and the Theremin evoke mellow swirls and half-forgotten 
siren songs. A lexicón for music evocative of the underwater world comes into 
being. 7 

In the mid-twentieth century, composers move from iconic to onomatopoetic, 
seeking to create sounds that sound as though they origínate in water. Electronic 
effects become important. 8 Ussachevsky’s 1951 “Underwater Waltz” employs 
reverberated piano. Synthesizers burble onto the scene: For Jean Painlevé and 
Genevieve Hamon’s 1965 film, The Love Life ofthe Octopus , musique concréte com- 
poser Pierre Henry offers oozy synthesizer noodlings. 9 The electric guitar is adapted 
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for surf music. The “sopping-wet ‘surf sound” (Bogdanov 2001, 105), realized by 
cc [l]iquid guitar drenched in deep-tank reverb” (Priore 2007, 72), results from 
reverberation: “Surf guitarists are noted for extensive use of the ‘wef spring reverb 
sound and use of the vibrato arm on their guitar to bend the pitch of notes 
downward .” 10 Jamaican dub drips reverb. David Toop suggests that “sonar 
transmit pulses, reverberations and echoes of underwater echo ranging and bioa- 
coustics” constitute the “nearest approximation to dub” (1995, 116; cf. Henriques 
2003). In the late twentieth century, Detroit techno outfit Drexciya conjured an 
imagined underwater sonic universe. Their homage to Kraftwerk’s “Autobahn,” 
“Aquabon,” guides Germán electrónica into a disturbing fantasy of a black 
Atlantis founded by Africans thrown overboard during the Middle Passage . 11 
Important in these works is how the underwater world is imagined, for this imagi- 
naron remains influential when composers work under “real” water as they coax 
out of the médium that which they imagine they should hear. Underwater sound- 
worlds are unearthly, evanescent, all encompassing, dreamlike, alien . 12 Aesthetics 
saturates technique. 

Invoking: Douglas Kahn’s Noise Water Meat is the essential scholarly work 
about modernist music that calis upon water. “The first notable use of wet percus- 
sion was Erik Satie’s use of boutelliphone (a series of tuned bottles suspended from 
a rack, c a poor man’s glockenspieP) in Parade (1918)” (Kahn 1999, 247). 13 John 
Cage’s 1932 Water Music “included among its forty-one events a duck whistle 
blown into a bowl of water and two receptacles for receiving and pouring water” 
(Kahn 1999, 242). The genre of “drip music,” starting in the 1950S, dipped into non- 
sense, babbling . 14 However, “Drops of water were [also] conducive to music because 
they could comfortably assume musical speeds and were amenable to total organi- 
zation by the composer” (Kahn 1999, 251), as in Hugh Le Caine’s 1955 “Dripsody: 
An Etude for Variable Speed Recorder.” Yoko Ono’s 1963 “Water Piece” (followed 
by her 1971 “Toilet Piece”) keys us into the context for drip music, the art move- 
ment known as “Fluxus,” Latín for “flow .” 15 In the late 1960S, flows and drips — 
artificially created in laboratories of modernist aesthetics — were joined by a new 
genre of field recordings. In 1966-1967 New Zealand composer Annea Lockwood 
began a project called “The River Archive,” recording sounds of rivers around the 
world. 

The aesthetic aim in pieces that evoke and invoke water is a sense of immersion 
(cf. Grimshaw, this volume), and one purpose of this chapter — especially as I turn 
in the remainder of this text to underwater music that soaks — is to think critically 
about how that immersion is achieved in controlled lablike spaces such as swim- 
ming pools and in the wilder field of the ocean. It turns out that immersion is 
accomplished through composers’ appropriation of scientific and technical models 
of underwater soundworlds, as well as their tweaks of those models to align with 
ideas about how the underwater domain should sound. If the sounds of Science 
(Mody 2005; Pinch and Bijsterveld 2004) satúrate underwater music, these sounds 
are multiply mediated and manipulated. 
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Tacking between Field and Lab in 
Underwater Music: Submarine Noises 

AND WhALE SoNGS 


Musical language shapes descriptions of early experiments in underwater sound 
propagation. In a 1708 issue of the Royal Society’s Philosophical Transactions , Francis 
Hauksbee published an “Account of an Experiment Touching upon the Propagation 
of Sound through Water,” in which he pronounced that a bell under water sounded 
“much more mellow, sweet, and grave at least three notes deeper than it was before” 
(1708, 372). 16 Imagery of music played under water reverberates through early 
technoscientific inquiries into submerged sound. 

By the early and mid-twentieth century, as oceanographers and antisubmarine 
warfare researchers listened closely to the underwater realm — realizing that it was 
not a place of silence — music and its metaphors continued to shape sea sound 
description. Maritime military research history entwines with musical history: 

In World War I the composer and conductor Sir Hamilton Harty was called in by 
the British Admiralty’s Board for Invention and Research to identiíy the most 

likely frequency bands of hull and propeller noises Ernest Rutherford also 

took a colleague with perfect pitch out in a small boat as part of the war effort. At 
a prearranged spot one of the great ñames in atomic physics took a firm grip of 
his companion’s ankles while this man stuck his head into the Firth of Forth and 
listened to the engine note of a British submarine. Hauled back into the dinghy 
and toweling his head he announced it was a submersible in A-flat. (Hamilton- 
Paterson 1992, 114-15) 

Submarine pilots used less fanciful discernment. The U.S. Navy created instruc- 
tional LPs to train submariners to distinguish enemy sub sounds from ambient 
noise. 17 “Still,” writes Hillel Schwartz: 

Sonarmen went “ping-happy.” Straining to identiíy threats within an underwater 
environment that behaved “very much like a large empty room with bad acoustic 
properties,” they heard pings bouncing off what turned out to be whales and 
schools of fish, heard pips refracting off what turned out to be temperature 
gradients, heard roars from what turned out to be waves rushing at rocks on 
distant shores, and heard much better in mid-morning than in the late afternoon. 
(forthcoming) 

In order to endow submarine space with immersive sonic depth, to carve a 
soundscape for humans out of the subaqueous milieu, it takes technical and 
cultural work and translation. Equipment must be constructed that can capture 
submarine vibrations in the audio register and ready them for humans to listen 
to — equipment like hydrophones, which can capture underwater vibrations using 
microphones fashioned of ceramic or some other material sufficiently denser than 
water to allow propagating waves to be impeded. The earliest hydrophones were 
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manufactured in 1901 by the Submarine Signal Company of Boston, which imag- 
ined “a network of underwater bells whose sonorous gongs would carry through 
the water at great distances” (Schlee 1973, 246). The company, seeking an alternative 
to foghorns, built receivers to capture underwater bell sounds for listeners on sur- 
face ships — though plans to use bells for Morse code were scuttled by the turbu- 
lence of the submarine médium. Hydrophones carne into their own on submarines 
with sonar (sound navigation and ranging), which was in operation by the 1930S. 
Sonar works by bouncing signáis off the ocean bottom or other boundaries in water, 
permitting submariners to time echoes to compute distances. It produces a dimen- 
sional portrait — not so much a soundscape as a soundedscape. Hydrophonic signáis 
were rendered into stereo by the use of devices that transformed signáis arriving at 
sepárate underwater receivers into “binaurally centered” impressions in head- 
phones, creating spatial relations meaningful to hearing humans (Hóhler 2003). If, 
as Emily Thompson (2002) has argued, the soundscape of modernity is patterned 
by sounds fed through technological filters, underwater soundscapes do not exist at 
all for humans without such filtering all the way through. Sea becomes part lab. 

It was incumbent on early submariners to be attentive auditors of sonar, and it 
was through such listening that the crackling of crustaceans and the snapping of 
shrimp were disclosed, providing a portrait of soundscapes already in existence for 
underwater creatures with means to hear them (see Iselin and Ewing 1941). Such 
human listening participated in “field” Science in its canonical form — the investiga- 
ron of a space of shifting boundaries, of natural and cultural agents (Kohler 2002; 
see also Bruyninckx, this volume). The care with which submariners listened 
emerged in part from cold-war anxieties about the possibility that missing the 
faintest signal might be disastrous. 

It was through such field listening — filtering “noise” from sound, tuning it to 
the human auditory range — that whale “songs” were discovered. Biologist William 
Schevill of Woods Hole was the first to cali these sounds whale “music,” though 
musical metaphors — and comparisons to birds — circulated earlier. Once whale 
sounds had been separated, they were aestheticized as lonely, majes tic, ecologically 
tuned-in arias to the wounded sea (see Madsen 1999, 33). They were pressed onto 
LP, notably by bioacoustician Roger Payne on Songs of the Humpback Whale , in 
1970. They were fused with classical composition in Alan Hovhaness’s 1970 And 
God Created Great Whales , a mix of recorded and represented, indexical and iconic 
sounds. 

One could describe the trajectory this way: The underwater realm starts out as 
silent, becomes soundful (eerie with cold-war echoes), occasionally noisy, and, 
once so revealed, turns out to be full of music from creatures imagined cióse to 
humans in cognitive power, creatures whose songs can then be separated from their 
médium — field — for contemplation. This trajectory is accompanied by another, in 
which humans experiment with their own subaqueous music to see whether they 
can move from the iconic and symbolic mode of Ravel and Debussy, from the 
octopoid onomatopoeia of Pierre Henry, to a more indexical evocation of “actual” 
immersion, or soaking. This other trajectory for underwater music is less natural 
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historical than “experimental.” Having learned lessons from the oceanographic 
“field,” musicians tinker in lablike settings, spaces where they control boundaries 
and variables. The primary such space is the swimming pool, and one of the first to 
dive in is John Cage. 


The Pool of Experiment: Cage and 

Neuhaus 


Cage — son of an engineer who worked with hydrophones — first brought water 
sound/noise into his modernist composition 

in his collaboration with Lou Harrison, Double Music (1941), in which Cage 
specified the use of a “water gong (small — 12 ”-i 6” diameter — Chinese gong 
raised or lowered into tub of water during production of tone).” . . . Cage . . . 
traces his use of the water gong to 1937 at UCLA, where, acting as an accompanist, 
he sought a solution to the problem of providing musical cues to water ballet 
swimmers when their heads were under water. (Kahn 1999, 249-50; see also Hiñes 
1994 , 90) 

Cage’s approach later became more experimental, mixing subjective and scien- 
tific methods. Water was important to Cage because, as he put it, it “prepared me 
for the renunciation of intention and the use of chance operations” (quoted in 
Kahn 1999, 249-50). In such renunciations, the authorial self-dissolved — as in 
Cage’s tale of his time in an anechoic chamber, in which he heard his own blood 
flow as part of the environment. Water stood as a Symbol of gentle noise, of a scien- 
tific modernist sublime into which an individual might dissipate, a view literalized 
in the self-experiments of dolphin researcher John Lilly, who reported auditory 
hallucinations while floating in isolation tanks. 

In some measure, this immersive symbolism is read out of how human hearing 
works under water. Sound waves travel four times faster in water than air — making 
it nearly impossible for humans to use underwater acoustic vibration to lócate 
themselves in space. This difficulty is compounded by the fact that human ear- 
drums are too similar in density to water to provide the resistance that can inter- 
rupt many underwater vibrations so they might be translated into tympanic 
movement — sound — in the ears; many vibrations pass right through our bodies. 
For humans, underwater sound is largely registered by bones in the skull, which 
allow enough resistance — impedance — for vibrational motion to be rendered 
into resonances in the body. Moreover, conduction of sound by bone directly to 
the inner ear confounds differences in signáis received by both eárs, making it 
impossible to compose a “stereo image.” Unaided human ears perceive underwater 
sound as omniphonic: coming from all directions at once (and because of sound’s 
seemingly instantaneous arrival, often as emanating from within one’s own body). 
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The underwater world is not immediately a soundscape for humans because it does 
not have the spatiality of a landscape; one might, rather, think of it as a zone of 
sonic immanence and intensity: a soundstate. lS 

Sound installation artist Max Neuhaus was next in the pool: 

In Water Whistle [1971-1974], water was forced through whistles under water to 
produce pitched sounds that could be heard by the audience only when they 
submerged themselves. In Underwater Music [1976-1978], he modified this 
technique by using specially designed underwater loudspeakers and electronically 
generated sounds, which were composed through a combination of scientific 
experiment and intuitive, Creative decisions. (Miller 2002, 26) 

Simón Miller writes, “Such compositions literally immersed Neuhaus in the 
médium — the sites were swimming pools — as he adjusted pitch perimeters and 
envelopes, in effect ‘coloring 5 the sound. They dramatized the spaces of sound, its 
limits, because the médium contained them” (2002, 26). In planning for these 
works, Neuhaus made drawings to think through “configurations of the sound 
sources in the three-dimensional space of each body of water” (see figure 6.1). 
Neuhaus’s experiment, like the use of hydrophones and sonar, took place within a 
soundscape of modernity (Thompson 2002), depending crucially on standardized 
knowledge of how sound behaves in water. 

In this work, the pool is a lab, a site with clear boundaries within which 
variables can be manipulated and in which the “nature” of water can be brought 
“indoors” to be controlled and cleaned of agents that popúlate it in “the wild” (see 
Knorr-Cetina 1999; Kohler 2002). It becomes a placeless, universalized space, 
though because of its artistic repurposing, it also becomes a space in which 
subjective experience — objectively modulated — can be realized as immersive. So, 
rather than a space in which scientists generate inscriptions on paper (Latour and 
Woolgar 1986), it becomes a space for generating impressions. In addition, as with 
the “house of experiment” described by historian Steven Shapin (1999) — a site 
where gentleman scientists could gather to agree on matters of fact — this “pool of 
experiment” admits a properly trained public of modest auditors, ready to be 
immersed. 


Conducting Transduction: Redolfi’s 

Musique Subaquatique 


The aim of immersion is elaborated in work of French composer Michel Redolfi, 
who in 1982 performed in Dartmouth College’s indoor pool. As Redolfi put it, 
“[Ljisteners of the Underwater Concerts ‘immerse themselves not only in the 
90-degree heated swimming pool, but in the sound itself” (quoted in Charles 
1993, 60). How does this immersion work? 
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Figure 6.1. Max Neuhaus’s Projection of underwater shape with sound source locations, 
Water Whistle V, 1983, ink and colored pencil on paper, 96 x 96 cm; sound work 
reference: University of California, La Jolla, 1972 (www.max-neuhaus.info/ 

drawings/waterwhistle/index_drawings.htm). 

Redolfi uses modified speakers in his underwater events. Managing the mix of 
electricity and water is key. Redolfi’s sound engineer, Daniel Harris, explains: 

Having people, the audience, and speakers with electrical connections in the 
water together can be a concern unless proper measures are taken to insure that 
no harm comes to the audience. The SWSS (Sonic Waters Safety System) was 
developed in response to our concern about the speaker systems introducing 
stray electrical currents in the water. The SWSS inserís a 50-kHz pulse wave into 
all cables entering the water in the same manner as DC power is introduced into 
microphone cables to power microphones. The return signal is passed through a 
counter/comparator circuit, which will instantly shut down any line that drops a 
settable number of pulses, usually 3, indicating a short or other malfunction. 19 
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As this list of equipment makes clear, “immersion” is a hard-won pun, a hard- 
won laboratory effect. It depends on engineering a sequence of transductions — 
translations of signáis across various media, acoustic, electronic, watery — so that 
the transduction itself is inaudible, seamless. The immersive effect would be ruined 
if Redolfi’s listeners were electrocuted. 

The symbolism of water as a sublimely immersive médium must be actively 
realized technologically. Harris writes that “The genius of Michel Redolfi is in how 
he applies his knowledge of the human acoustics of underwater sound and intímate 
familiarity with the playback Systems to the composing and mixing of his very 
original and enchanting music.” In some cases, however, that knowledge is subor- 
dinated to RedolfTs sense of how underwater music should sound. Harris writes he 
writes, of an underwater xylophone: 

Hitting a metal bar under water results in a very unsatisíying “tink,” no matter 
how hard the performer strikes the instrument, ñor how heavy the bar. [So] I 
glued piezo sensors on the bars, which, when struck, triggered samples or other 
electronic sound sources via MIDI triggers. . . . The resultant audio was mixed 
with other sources and sent to the SWSS and then to the underwater speakers. 20 

Philosopher of music Daniel Charles hears such modulations as an ideal meet- 
ing of water and music: “What is at stake, then, is not the spatial idiosyncrasy of the 
environment, but the degree of achievement of the blending of a music which can 
be described as simple, tranquil and transparent, with the physical characteristics of 
water — its density, temperature, and color” (1993, 63). Redolfi is, of course, alive to 
how Science and symbolism come together, though he doesn’t always cali attention 
to the artificiality of that relationship. In connection with a later piece, Sonic Waters , 
he writes: 

[WJater materializes sound, thickens it, and makes it palpable and penetrable. 

Water and sound, combined together at the molecular level, create a sonic and 
fluid substance that can be appreciated not only by looking at its surface reflec- 
tions, but by sinking oneself into its volume, density, warmth and vibrations 
(quoted in Charles 1993, 63). 

In addition: 

[T]he very concept of underwater sound, Redolfi says, goes all the way back to 
the songs of sirens, the bells of submerged cathedrals, the voices of lost mariners. 
“These noisy and eerie myths,” he notices, “have been swept away by the XXth 
Century and replaced by a quiet and sterile belief in the ocean as a silent world 
and occasionally disturbed by the long song of the Aqua Diva whale. But the 
fact is that the sea is a cacophony of sound, complete with fish “barking 
and croaking,” shrimp “snapping,” dolphins “whistling” and sea urchins 
“click-click-clicking.” (Charles 1993, 63). 

Note that Redolfi does not mention sonar surveys, pings, or the like; technologi- 
cal intrusions do not figure in his impression of water. Such intrusions are 
far from trivial — as the ears of cetaceans damaged by underwater sounds attest 
(James 2005). Dialing in to deployments of sound deleterious to dolphins and 
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whales might reveal a genre of underwater music no one has yet considered: 
cetacean death metal. 21 

So, while Redolfi knows that transductions are necessary to immersion, he 
ontologizes water as dreamy, alive, penetrable. Charles argües that Redolfi has done 
something radical here, however. Redolfi not only seeks to: 

reduce all metaphors ofpresence, but make ofpresence the metaphor ofitself In that 
sense, Redolfi is no more a minimalist: thanks to technology, he transforms the 
very status of presence until presence becomes, through the acceptance of its 
reproducibility, an instance which does not need any more to be interpreted or 
symbolized or displaced, because it entails its own interpretation or symboliza- 
tion or displacements. (1993, 67) 

However, if we thought not of reproducibility but of transducibility , we might hear 
more clearly the material conditions of Redolfi’s “presence,” which depends, again, 
on making sure the underwater audience is not flash-fried by subaqueous electrón- 
ica (cf. Helmreich [2007, 2009] on how transduction bolsters perceptions of 
“presence” in submersibles; Henriques [2003] on transduction in reggae; and 
Roosth [2009] on transduction in sonocytology, listening to cellular life). 

From 1981 to 1984 Redolfi scaled up his enterprise, moving into the field — the 
ocean — when Sonic Waters was performed off the coast of Southern California. 
This opening up of the dream lab of the pool into the field was accompanied by 
campy sea creaturey devices, such as the giant colorful “jellyfish” that kept a low- 
frequency speaker afloat in La Jolla Cove. Such playfulness is a reminder that Redolfi 
does not imagine crustaceans, fish, or marine mammals as audiences: “Redolfi is 
concerned with humans: ‘Every dolphin, he says, has a person nowadays to take 
care of him. ... I prefer to take care of humans 5 ” (Charles 1993, 63). RedolfFs 
approach looks similar to that of the Florida Keys underwater music festival. 
Celebrating its twenty-fifth anniversary in 2009, the festival offers to scuba divers 
music played over Lubell Laboratory speakers attached to boats floating near the 
reef. 22 Attendees dress up as fish. 


Listening to the Sounds of Science: 
The Wet Sounds Festival 


In 2008, announcements of the United Kingdom’s first underwater sound festival, 
“Wet Sounds, 55 asked people to dip into municipal swimming pools to listen to 
music specially composed by industrial, electrónica, and noise performers, as 
well as sound artists (Blanning 2008). Sound art — a genre of art that creates 
sound objects to be experienced in galleries, in public spaces, and via headphones — 
made a good match with this festival, which treated the pool not just as a 
chill-out room but also as a gallery space/lab (on sound art, see Labelle 2006; 
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Licht 2007). Listeners’ attention was persistently drawn to technoscien tifie frames 
of reference. 

Wet Sounds pieces are archived online. A few are hydrophonic recordings — so 
that the fact that they were played back under water raises the question of whether 
these are compound or redundant underwater pieces (and what happens when we 
listen in air?). The attempt to superimpose one underwater space (the ocean) on 
another (the pool) makes particularly explicit the múltiple meanings of “médium” 
in this practice of schizophonia — the splitting of sound from its source. 23 Slavek 
Kwi’s “Sonafon,” for example, is “a structural and textural exploration of echoloca- 
tion sounds made by Pink Dolphins recorded in Rio Jaupeperi in Amazonas, Brazil, 
using ultrasound range hydrophones.” Klaus Osterwaldt’s “Donatus Subaqua” 
offers “Recordings made in a quiet lake in the forest using hydrophones placed two 
meters below the surface. There are sounds of the underwater environment like gas 
bubbling up from the bottom, plants producing oxygen, inseets and even the calis 
of waterboatman.” Amie Slavin’s “Wave Play” is “an abstract interweaving of pulse, 
saw, and triangular waves designed to invoke the playfulness and the latent power 
of waves, in both sound and water.” 24 In addition, Disinformation’s “Ghost Shells” 
plays back sferics and whistlers, sounds produced by storm disturbances in Earth’s 
electromagnetosphere (and captured on very low-frequency radio bands): “These 
phenomena are referred to as ‘hydrodynamic 5 because the math used for modelling 
their behaviour has been extrapolated from observations of how equivalent wave 
phenomena behave in the variety of fluid media” (see Motschmann, Sauer, and 
Baumgaertel 1984). 25 By 2008, then, scien tifie representation — as a warrant for 
underwater realism, as an aestheticized device for delivering other worlds, as a fetish 
for formatting serious art — was sharply in presence as a passage point for making 
underwater music and sound art. 


Queering the Mermaid: Snapper, 
Oleson, Leber, Chesworth 


In May 2009 singer Juliana Snapper premiered an underwater opera titled “You 
Who Will Emerge from the Flood” (written with composer Andrew Infanti) at the 
Victoria Baths, a historie public swimming pool in Manchester, England. Snapper 
took on the challenge of singing under water, fusing technique and aesthetic: 
“Maximizing bone conduction and controlling bubble output as part of a new vocal 
fabric,” her website reports, “Snapper merges extended techniques with Baroque 
tropes that represent human longing and passion as aspeets of weather. Pre-recorded 
sounds from oceanic bubble fields and birdcalls throb above the water as Snapper’s 
voice (amplified by an underwater microphone) presses through the soundscape.” 26 
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There is a virtuoso, alien effect to be won from doing something under water that 
requires breathing. 

As with the work of Neuhaus, Redolfi, and Wet Sounds participants, this is 
music that requires research . About the field. About the body. Snapper writes as fol- 
io ws: 

I have researched underwater acoustics by reading, Consulting at the Scripps 
Institute of Oceanography, and spending hours submerged in my bathtub and 
borrowed pools. My experimentation up to now has allowed me to control my 
voice for long stretches under water, negotiate changes in depth and pressure, 
and to invent a new expressive vocal language that I cali mouth-to-water 
singing. . . . Vocalizing in water involves working with air pressure shifts that 
compress the air in the lungs and effect the sound of the phonation at varying 
depths. Mouth-to-water singing relies heavily on bone conduction to transmit 
voice, and that vibration is also the basis for sensing/hearing sounds in the 
water. The interference of bubble sounds in the breath-born phonation begets 
lightly percussive rhythmic textures, and it may be possible to control the 
pitch material and rate of bubble noise through breath pressure and buccal 
aperture so that as I sing I am also releasing a secondary melody in duet with 
my vocal cords. 

Snapper imagines herself a modern day mermaid — though one schooled in 
critical feminism. Snapper, who performed at the “Queer Up North” festival in the 
UK, does not stage herself as a deliquescent delicacy for a heterosexual masculine 
imagination, a woman merging with a uterine médium, a móbius mother/lover in 
the way that, as Douglas Kahn has argued, “submerged women” have persistently 
been posed, particularly in surrealist represen tation (see Redolffs website, full of 
pictures of bikini-ciad women floating in pools). 27 Snapper modulates water as 
feminized other into a critical substance, one that can detour the way water and 
waves have been symbolized as feminized flux (see Rodgers forthcoming). She is an 
active rather than a passive part of the médium — the aim of her work is thus dis- 
tinct from Cage’s, which sought to elimínate intentionality. She is interested, too, in 
the pool as a simultaneously public and intímate space in which sex and sexuality 
have historically been subject to disturbance — and also strictly ordered, policed, 
and ranked (she reports that, historically, the water used on “women’s days” in the 
Victoria Baths was recycled from “men’s days”!); what better place to play with 
gender and sexuality? 

Snapperís critique of canonical mermaid models becomes crisp in work in 
prívate spaces. In “Aquaoperas,” Snapper joins Jeanine Oleson to visit home bath- 
rooms and perform minioperas, with Snapper in the tub and Oleson singing into 
the toilet (see figure 6.2), their voices fusing through a “snorkelabra.” Snapper and 
Oleson’s performance of what they cali the “hot lez flower duet” from the opera 
Lakmé by Léo Delibes combines a lesbian sublime with a heterodox dabbling 
with the abject space of the toilet. In this “pool of experiment,” this soak opera, the 
relation of water to public/private, feminine/feminist/queer is up for grabs. 
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Figure 6.2. Photo of Juliana Snapper and Jeanine Oleson performing Aquaopera #2 

(SF/Lakme Redux at Shotwell Shack, San Francisco). 


Snapper and Oleson’s work is kin to the sinister work of Australian sound art- 
ists Sonia Leber and David Chesworth, who in The Gordon Assumption (2004) offer 
a recording installed in the subterranean toilets of Gordon Reserve at Parliament 
Station, Melbourne: 

An incessant outpouring of female voices lures passers-by down the stairwell to 
the cave-like subterranean toilets. At the lower gates, they are confronted with 
an asynchronous chorus of female voices in infinitely rising pitch. The voices 
gather and thicken without respite, in upwards glissandi, constantly trailing 
upwards. . . . The voices recall the mythologies and mysteries of voices heard in 
caves, where the voices of spirits, sibyls and oracles are believed to announce 
predictions and warnings from the mouth of a cave. (Leber and Chesworth 2004, 
quoted in Kouvaras 2009, 101) 

Linda Kouvaras reads The Gordon Assumption as a feminist critique of mythological 
siren songs; the songs float indeterminately above signification, yes, but are sited 
in an abject lócale. The piece recalls Duchamp’s urinal. Pools, baths, and toilets 
become laboratories for rethinking water, gender, and sexuality. 

But this laboratory research longs to go into the field. Snapper writes, “I am 
now fully prepared to realize opera in the ocean depths.” Why? Snapper writes, 
“I am interested in the rub of cultural metaphors in which water represents a dan- 
gerous zone of ‘puré’ emotion and now-urgent specters of drought and drowned 
cities.” Though she doesn’t mention it, Snapper’s words summon memories of 
Hurricane Katrina (recalled in the image of an <£ underwater jazz funeral” used in a 
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review of a 2008 play staged by Tulane Environmental Law Society students in 
New Orleans). Snapper suggests that “the material tensión between water and air 
(breath, foam, thirst) . . . speaks to an unknown ecological future.” Ecology is of 
signal interest in new underwater music. This is another arena in which Science and 
sentiment, substance and Symbol join. 


Return to the Sounded Sea: Winderen 


Tracking back from lab to field, we find Norwegian field recordist Jana Winderen. 
Winderen uses hydrophonic recordings “sourced from beneath the oceans sur- 
rounding Norway, Greenland, and Iceland” (White Line [UK] ): “The music here is 
assembled from various auditory documents gathered from research trips, all 
treated as improvisational material, and morphed into elabórate sound collages” 
(Boomkat [UK] ). Heated (2008) is a recording of warming water. Like other under- 
water composers, Winderen is interested in technologies of underwater audition. 
HeatecC s liner notes report the use of the following: “2 x 8011 DPA hydrophones, 
2 x DolphinEAR/PRO hydrophones and 2 x 4060 DPA microphones.” Winderen 
is artist and empirical researcher both. 

What is all this water music in search of? Again: immersion, though now in the 
sea — though a sea accessed by treating it as a lab, a recording studio. One reviewer 
writes as follows of Winderen: “With my headphones on and my eyes closed it 
sounds as if you are really in the middle of this water” (Earlabs [Netherlands]). 
However, the possibility of imagining oneself immersed depends, press rewind, on 
transduction. It depends on a cyborg sensibility — one indexed in David Toop’s 
Ocean of Sound, which he concludes with this cybernetically inflected contention: 
“Music — fluid, quick, ethereal, outreaching, time-based, erotic and mathematical, 
immersive and intangible, rational and unconscious, ambient and solid — has 
anticipated the aether talk of the information ocean” (1995, 280). 

However, if immersion is a submersion of self in water, water is controllable 
only in swimming pools. The “field” is different, full of other critters. Winderen, in 
“The Noisiest Guys on the Planet,” uses snapping shrimp to shift us to a multispe- 
cies soundscape. 28 Unlike Redolfi, who ignores underwater nonhumans, Winderen 
calis us back to these creatures, sentries now in climate change. The field, Winderen’s 
work suggests, is becoming more like a lab not only for artists like herself but also 
for humanity, experimenting on its ecosystem. 

What Winderen is creating, then, is not just music but — in the idiom of sound 
art — documents as well, one reason she and others find technological access to and 
scientific models of the sea compelling. If earlier generations of composers sought 
simply to replícate a submarine sublime, today’s sound artists hope not just to soak 
in sound but also to broker ear-opening accounts of human relations with the water 
around us. 
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Dunn’s Chaos and the Emergent 

MlND OF THE POND 


Skip back to David Dunn’s 1992 Chaos and the Emergent Mind ofthe Pond , a collage 
of recordings of aquatic insects in ponds in North America and Africa. This col- 
lage — fusing recordings from different ecologies — stages a different construal of 
underwater worlds, one that hears a percolating intelligence in water. In Insectopedia , 
Hugh Raffles writes of Dunn’s practice: 

Listening to the pond with two omnidirectional ceramic hydrophones and a 
portable DAT recorder, he hears a rhythmic complexity altogether greater than 
that in most human music . . . The sounds canT be arbitrary, he decides. These 
animáis are not simply following their instincts. “The musician in me cannot help 
but hear more.” . . . He begins to hear the pond as a kind of superorganism, a 
transcendent social “mind” created from the autonomous interaction of all the 
life within it, terms not dissimilar to those used by complexity theorists to 
describe the nest colonies of the eusocial insects . . . [His] soundscape is more 
than a recording, more even than a composition. It is also a research method, one 
that flows easily from a principie of wholeness. (2010, 323-24) 

The piece is inspired by anthropologist Gregory Bateson’s model of mind as a phe- 
nomenon present in worldly relations, not locked in people’s heads. Dunn’s insect 
recordings posit water not simply as a médium but also as organically enlivened, 
cogitating: “[W]hen I see a pond, I think of the water’s surface as a membrane 
enclosing something deep in thought” (quoted in Ingram 2006, 129). Unlike Cage, 
who would have advocated listening to these sounds “in themselves,” Dunn wants 
to preserve sounds’ referentiality, their link to empirical ecological processes 
(Ingram 2006) — as on his recordings of bugs in piñón trees, in which he tracks how 
their sounds flag global warming. While there exists a risk of romanticizing bal- 
anced nature, Dunn’s Batesonian approach tunes into how water may not only 
contain life but also be constituted through living things. There can be no purely 
lablike pool of water; water is made of vitality — which gets us to field recordings of 
the vital signs of ocean Earth. 


From the Cold War to Global 
Warming: Under Arctic Seas 


Andrea Polli’s Sonic Antárctica CD consists of “recordings of the Antarctic sound- 
scape made during the author’s seven-week National Science Foundation residency 
in Antárctica during the 2007/2008 season.” Polli mixes field recordings of melting 
glaciers with audio translations of scientific data (“sonifications” [Supper, this volume] ) 
on climate and peppers these documents with snippets from interviews with 
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climate scientists. The sound of ice melting and of data about ice melting are signs 
of global warming. This aesthetic production has a scientific-political point. 29 

Charles Stankievech’s “DEW project” has a scientific-political point as well. As 
Stankievech explains, a As much ideological deterrent as defense infrastructure, the 
Distant Early Warning (DEW) Line constructed between 1954-56 [near the Arctic 
Circle] was a joint venture between the US Air Forcé and the Royal Canadian Air 
Forcé. A long-distance radar and communication system, the DEW Line created an 
electromagnetic boundary able to detect airborne invasión.” 30 Stankievech revisits 
questions of territorial sovereignty, listening not up for enemy others but down for 
global humanity’s ecological depredations: “The radio station [at Stankievech’s 
Yukon river site] monitors the sounds of the river’s ice and underwater flow on a 
continual basis, transmitting the signáis to Dawson City, where the field-recordings 
are processed and broadcast via the internet.” 31 April 6, 2009, saw Stankievech 
deliver “a sound performance using live samples from the river installation, electro- 
magnetic microphones, radios and Computer.” Submarine sounds such as these 
melt the distinction between music and data collection. 

However, even data collection is available for aesthetic contemplation. The 
PerenniAL Acoustic Observatory in the Antarctic Ocean (PALAOA) transmits 
sound from the Antarctic Ocean. In addition, PALAOA (Hawaiian for “whale”) 
detects marine mammals and provides a research baseline for relatively quiet 
underwater environments. 32 The PALAOA MP3 audio stream at the Alfred- 
Wegener-Institut für Polar- und Meeresforschung website anticipates listeners, 
warning that its sound is “not optimized for easy listening, but for scientific 
research. . . . [Bjeware of sudden extremely loud events.” 33 The sound travels from 
hydrophones to a research station and then to Germany, where it is put online. In 
addition, PALAOA’s hydrophones, hanging just below the hundred-meter-thick 
Ekstróm ice shelf (near Neumayer Station, a Germán research center), are placed 
well above the SOFAR (Sonar Fixing and Ranging) channel, a layer of seawater in 
which the speed of sound reaches its underwater mínimum. Low-frequency vibra- 
tions can travel long distances through this conduit (about 800 to 1,000 meters deep 
at midlatitudes and higher toward the surface in températe zones) before they dis- 
sipate. 34 Sounds from this región are essential for ocean acoustic tomography, the 
study of ocean temperature using sound (Munk, Worcester, and Wunsch 1995). 
Submarine sounds in art and Science now echo concerns not about the Coid War 
but about global warming. Rather than evoking, invoking, or soaking, they broker 
connections between the ocean understood as natural field and considered, for 
better or worse, as a lab for global ecopolitical futures. 


CONCLUSIONS 


Summing up the are of underwater music over the last half-century: There 
have been two primary venues for underwater music: field settings of the ocean 
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(with rivers, ponds, and lakes less frequently used) and lab settings of swimming 
pools. For the field tradition, underwater music emerges from the noise of the Coid 
War, which reveáis the songs of whales. Those songs then become submerged within 
the worked-over subaqueous soundscapes of modern human enterprise — sound- 
scapes that harbor evidence of global warming, of sea creatures under stress, evi- 
dence that becomes source material for composers who mine scientific idioms for 
artistic and political statements. For the lab tradition, the pool begins as a stage to 
realize the ascetic aesthetic of Cagean modernism. It then becomes a space to play 
with meanings of water — either, as with Redolfi, to reinforce canonical symbolisms 
of dreamy, meditative, womblike space or, as with Snapper, to queer such imagery, 
to experiment with gender, sexuality, and public and private. Flowever, while the 
field setting is “wild” and entangled with nonhuman sounds and the lab setting is 
more social, cultural, or anthropocentric, artists working in both settings seek to 
evoke an “immersive” experience. Moreover, in both settings the transductive prop- 
erties of water must be managed in order to invoke water as a material accomplice 
in this enterprise, this aim of soaking listeners in the sublime surround of sound 
submerged. 


NOTES 


1 This is not a history of underwater sound, which historians of oceanography and 
acoustics have already delivered. Hóhler (2002) describes a shift from sounding the sea 
with metal ropes to sounding with reflected sound, a practice that turned the ocean into a 
three-dimensional volume ( sound as fathoming has moorings in the Oíd English sund , 
“sea,” whereas sound as vibration reaches back to Oíd English swinn , “melody.”)- 

2 Kahn calis such evocation “programmatic, depicted, or discursive water” (1999, 245). 

3 Wagner linked water and music: “If rhythm and melody are the shores on which 
music touches and fertilizes the two continents of the arts that share its origin, then sound 
is its liquid, innate element; but the immeasurable extent of this liquid is the sea of 
harmony” (quoted in Kahn 1999, 246). This chapter leaves aside evocations of sea crea- 
tures, such as Camille Saint-Saéns’s Aquarium movement in his 1886 Le Carnaval des 
AnimauXy or Edward MacDowelFs 1898 Nautilus in Sea Pieces (see Shick 2007). 

4 Toru Takemitsu’s I Hear the Water Dreaming (1987) evokes water in its Debussy- 
styled sound and also formally, through an E-flat - E - A motif that — spelled in Germán as 
Es-E-A — transliterates the word sea. 

5 See Chen (2008, 6) on “trémolos to describe the shimmering effects of light on a still 
water surface; ascending and descending arpeggiated figures and glissandi to depict the 
undulating of waves.” 

6 Cousteau’s show “The Undersea World of Jacques Cousteau” was soundtracked by 
Walter Scharf, who sought evocative music for animal subjects. For a special on sea 
elephants, “The music had a wonderful reedy quality, the same as the sea elephants, but 
with added feeling” (quoted in Shaheen 1987, 96). Cousteau’s editors made modifications: 
“The sounds of the manatees in The Forgotten Mermaids were too abrasive so the music 
editor rearranged several notes to help soften the manatee munches” (Shaheen 1987, 99). 

7 See www.filmscoremonthly.com/board/posts.cfm?threadID=59i57&forumID=i&arc 
hive=o. Sound recordist Darren Blondin (2007) writes: “Films have taught us that 
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underwater sounds [are] muffled, echoing, and bubbly. In actuality water is alive with high 
frequencies, but a bright sounding recording tends to come off as less realistic.” 

8 Though an earlier — mechanical — precedent is Italian futurist Luigi Russolo’s 
“gurgler” (“gorgogliatore”), an intonarumori (noisemaker) described in his 1913 Art of 
Noises manifestó, which called for nonrepresentational sounds — splashing, sirens — in 
composition. More satirically, Spike Jones’s 1948 “William Tell Overture” features 
gargling. In experimental composition, Francisco Lopez’s 1993 Azoic Zone , with pieces such 
as “A Vibrational Trip From Bathyal to Hadal Zones,” sounds sourced from hydrophonic 
recordings but is merely timbrally evocative of them. 

9 The indie band Yo La Tengo , resoundtracking Painlevé, retains burbly emphases. 

10 Http://en.wikipedia.org/wiki/Surf_music. “Wetness” is now taken as such an 
obvious term that even a sober book on architectural acoustics contains Índex entries such 
as “reverberation, excess creates aural soup” and “reverberation, soup and mud” (Blesser 
and Salter 2007, 428). See Doyle 2005. 

11 Listen to Parliament’s 1978 “Aquaboogie” for more drowning sounds from 
the black Atlantic. Compare Sun Ra’s visión of a black utopia in outer space 
(Williams 2001). 

12 On FreeSound, a database for sound effects, an MP3 titled “under_alien_ocean” is 
described as “beneath alien waves, its [sic] liquid, but it ain’t water ... oh and its [sic] from 
1950.” www.freesound.org/samplesViewSingle.php?id=i426o. 

13 Henry Cowell “used ‘8 Rice Bowls’ tuned to no definite pitch using water for 
Ostinato Pianissimo (For Percussion Band) (1934)” (Kahn 1999, 248). South India’s 
Jalatarangam (“water waves”) is a carnatic instrument that uses bowls filled with water. 
Steven Feld (1991) describes flowing water in songs about the Kaluli of Papua New Guinea. 
Satie wrote satirically on the move from evocative to invocative: “The hydrographic 
engineers tell us that all the waterfalls of the earth, whatever their social standing might be, 
yield a low F, clearly audible, upon which it so happens is built a perfect chord in C Major. 

. . . The Water Company is elated: it is going to install carefully calibrated conduits in all 
the concert halls to offer musicians an entire chromatic scale of little cascades” (quoted in 
Kahn 1999, 247). Satie would not have been surprised when “Cage composed . . . Water 
Walk: For Solo Televisión Performer (1959). . . . The water-related instructions and proper- 
ties . . . inelude a bathtub of water, an operating pressure cooker, a supply of ice cubes, a 
garden sprinkling can, a soda siphon” (Kahn 1999, 250). 

14 “Aldous Huxley . . . in ‘Water Music’ (1920), anticipated the importance of 
dripping water in Fluxus and, later, in chaos theory: ‘Drip drop, drip drap drep drop. So it 
goes on, this water melody forever without an end. Inconclusive, inconsequent, formless, 
it is always on the point of deviating into sense and form’ ” (Kahn 1999, 252). 

15 See Dunn and Young (2008), an anthology of drip music. Christian Marclay’s 1990 
Bottled Water is an installation consisting of bottles filled with tape containing the sound 
of dripping water. The “hydraulophone” uses streams of water in a flutelike apparatus: 
http://hackedgadgets.com/2007/02/17/ontario-science-centre-hydraulophone-musical- 
keyboard-water-fountain/. 

16 See also Colladon (1893) on 1826 experiments at Lake Geneva. 

17 Consult www.hnsa.org/sound/, an archive of navy training sounds. 

18 Differences between fresh water and seawater are consequential, too. 

19 Http://danielharrismusic.com/Underwater_Music_I.html. 

20 Compare Alan Silvestres soundtrack for TheAbyss (1989), which uses “electronic 
pinging and underwater clanging effects” (www.filmtracks.com/titles/abyss.html). 

21 Compare the use of whale songs to frighten seáis away from Lincolnshire river fish 
stocks: “To drive the seal family back into the North Sea, the National Rivers Authority 
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[sic] have been playing recordings of killer-whale songs under the surface of the Glen.” 

The Times , Oct. 31, 1994, quoted in Toop 1995, 3. 

22 Listen also to Erik DeLuca’s “The Deep Seascape: The Sonic Sea,” which “explores 
the underwater soundscape of South Florida” (www.erikdeluca.com/). 

23 So dimensional has the underwater world become that artist Bill Fontana imagines 
the transplant of a watery soundscape — as in his 1994 lie Sonore , transposing sound from 
the beaches of Normandy to the Are de Triomphe in Paris, making the “white noise of the 
sea” (Madsen 1999, 29) surround a trafile island. 

24 “There has been a long-standing association of water and sound in observational 
acoustics from antiquity through Chaucer to Helmholtz and beyond, with the sound of a 
stone hitting water producing a visual counterpart, which was then mapped back onto the 
invisible movements of sound waves” (Kahn 1999, 246; Rodgers, 2009). The correspondence 
between water and sound waves is at the heart of “cymatics” (Jenny 1967, 1974; Lauterwasser 
2006), which employs sound vibration to generate patterns in watery substrates. 

25 The 2009 Wet Sounds festival promised “deep listening” (a term coined by Pauline 
Oliveros in 1991, which referred less to water than to a mode of attention) (see Alian 2009). 
Sound artists continué to organize festivals of underwater audio. “Hydrophonia,” a festival 
of hydrophone sound art dedicated to raising public awareness of ocean noise” has been 
held in 2009 and 2010 in Italy and Spain. Refer to http://hydrophonia.com/ 

26 Quoted on http://maryrachel.wordpress.com/2009/06/19/underwater-microphones- 
and-juliana-snapper/ 

27 And in tourist sites such as Florida’s Weeki Wachee mermaid attraction, though 
women there only pretend to sing in the giant fishtank in which they perform. One wonders 
whether in íuture they will use Stachowski’s (1999) apparatus for talking under water. 

28 See Stocker (2002/2003). Sheila Patek studies lobster and shrimp sound and 
hearing in Biology @ Berkeley: http://www.bio.umass.edu/biology/pateklab/home. 

29 Compare DJ Spooky/Paul D. Miller’s Sinfonía Antárctica , which uses field record- 
ings to “capture the acoustic qualities of Antarctic ice forms, [which] reflect a changing 
and even vanishing environment under duress” (http://djspooky.com/art/terra_nova.php). 
Miller’s piece is an homage to Ralph Vaughn Williams’s 1952 Sinfonía Antárctica , which 
evoked the austral landscape with orchestral arrangements. Miller suggests that his 
Sinfonía will bypass “metaphor” and “go to Antárctica and record the sound of the 
continent.” Compare Peter Cusack’s Baikal Ice (Spring 2003), which contains hydrophonic 
recordings of the springtime sound of ice thawing at Siberia’s Lake Baikal. 

30 Www.stankievech.net/projects/DEW (accessed 9 September 2009) 

31 “A remóte transmission station housed in a geodesic dome on the Yukon and 
Klondike rivers continually records and transmits the sounds of the rivers flowing and the 
ice shifting using hydrophones embedded in and under the ice.” 

32 Www.awi.de/en/ research/ new_technologies/ marine_observing_systems/ocean_ 
acoustics/palaoa/. Christine James’s 2003 “Sonar Technology and Shifts in Environmental 
Ethics” charts the rise of noninvasive sonar in concert with environmental movements. 

33 w.awi.de/en/ research/ new_technologies/marine_observing_systems/ocean_acous- 
tics / palaoa / palaoa_livestream/ . 

34 Garry Kilworth’s Science fiction story “White Noise” tells of a haunted undersea 
cable station. Characters find that the deep retains noises from ages past because (in a 
fabulous piece of SF logic), “ c Cold, dense water is less likely to disperse or be infiltrated by 
warm currents. The circular currents weave their way intact around the ocean floor like 
blind worms.’ ‘And they retain sound patterns . . .’ ‘Like magnetic tape’ ”(1990, 513). The 
main characters, listening in on a microphone attached to a deep-sea cable beneath the 
Red Sea, hear Moses leading his people out of Egypt. 
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CHAPTER 7 


A GRAY BOX: THE 
PHONOGRAPH IN 
LABORATORY 
EXPERIMENTS 
AND FIELDWORK, 

1900-1920 


JULIA KURSELL 


Introduction 


In 1878, the year after Thomas Alva Edison submitted the first patent for his phono- 
graph, two British physicists, Fleeming Jenking and J. Alfred Ewing, published a 
report in the scientific journal Nature of a recent experiment they had conducted. 
This experiment used Edison’s device to address the nature of spoken language: 

Let a set of vowel sounds, as A E I O U (pronounced in Italian fashion), be 
spoken to the phonograph in any pitch and with the barrel of the instrument 
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turned at a definite rate. Then let the phonograph be made to speak them, 
first at the same rate, and then at a much higher or lower speed. The pitch is, 
of course, altered, but the vowel sounds retain their quality. (Jenkin and Ewing 
1878, 384) 

As Jenkin and Ewing noticed, the faster or slower speed of the phonograph altered 
the pitch of the vowels, making them appear to be spoken by a higher or lower 
voice. Yet, they found the quality of the voice remained the same: They could still 
recognize the recorded vowels. This experiment was soon contested. Charles Cross, 
working in Boston, challenged these findings with his own experiments using one 
of the first prototypes of the phonograph built by Edison. Cross (1878), in a reply 
that was also published in Nature , asserted that a change in speed actually distorted 
the vowels to the extent of making them unintelligible. For Jenkin and Ewing, how- 
ever, their experiment had a second purpose, one that is easily overlooked. Most of 
their report was devoted to the reconstruction of “the instrument.” Unlike Cross, 
they did not have access to the instruments Edison had built himself. For Jenkin 
and Ewing it was therefore a major concern whether they would manage to con- 
struct the phonograph at all based on the inventor’s description. They struggled 
with every aspect of the complicated apparatus, such as finding a foil that would 
allow the recorded sound to be replayed more than once and making a spring 
that would not only hold the speed of the barrel constant but also allow for con- 
trolled deviation. Such concerns are part and parcel of laboratory work and the 
well-known difficulty in replicating pioneering experiments (Collins 1985). Their 
article also demonstrates that they understood the phonograph to be a scientific 
instrument whose technical functions had yet to be defined. The mass production 
of phonographs, either for the laboratory or for commercial purposes, had not yet 
occurred. 

Some twenty years later, a group of experimental psychologists, based at the 
Berlin Institute of Psychology, started to intégrate the phonograph into their scien- 
tific work on a systematic basis. In 1900 Cari Stumpf, founder of the institute and 
its director until 1921, recorded a Siamese court theater group on twenty wax cylin- 
ders. These recordings laid the foundation for what was to become the Berlin 
Phonogram Archive. In 1908, when the existence of this archive was officially 
announced, the collection contained more than one thousand ítems — the number 
of phonographic rolls would eventually grow to thirty thousand. 

In 1901 Stumpf extensively discussed his first recordings in the institute’s jour- 
nal, Beitrage zurAkustik und Musikwissenschaft. Summing up, he touched upon the 
issue of methodology. In order to preserve samples of music that were as yet 
unknown to researchers and were in danger of being lost forever, a double strategy 
was recommended. Traditional notation, though necessary for discussion within 
the scientific community, was not sufficient but had to be backed up by phono- 
graphic recordings (Bruynincks, this volume). However, Stumpf was well aware of 
the difficulties posed by phonographic recordings. Among the major problems he 
listed was the recording speed: “Most importantly, one has to provide for constant 
rotation speed . . . The difficulty lies in . . . avoiding the troubles caused by arbitrary 
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diminishing of the rotation speed, as this will produce sounds that are too low and 
tend to blur” (Stumpf 1901, 135-6). 

Twenty-five years later, in 1926, Stumpf summed up his research of the past two 
decades in a treatise on the sounds of language, titled Die Sprachlaute (1926). One 
chapter was devoted to phonographic experiments with vowels. The initial ques- 
tion that had driven Jenkin and Ewing to reconstruct the phonograph had since 
been solved, Stumpf declared. It was now agreed that reproducing recorded vowels 
at a different speeds would alter their sound. This was because the characteristic 
sound of the vowels resulted from resonances that depended on the shape of the 
mouth cavity. The pitch of these so-called formants was fixed. A change in the rota- 
tion speed would thus alter the pitch of the recorded resonance frequencies and 
therefore distort the vowels' characteristics. 

Stumpf s very detailed experiments, however, did more than just provide yet 
another verification of this already accepted theory. As his report demonstrates, 
even by 1926, when wire recording had started to replace it as a recording device, 
many aspects of the use of the phonograph were still in need of careful study. The 
phonograph thus appears as a site of intersection for technology, experimental 
practices, and ways of hearing and listening, none of which were stable. Rather, they 
continually changed in the course of the experimental work, and the means of 
recording and the means of listening were constantly adjusted in relation to each 
other. Practices of fieldwork involving phonographs, tests in the laboratory, and the 
organization of storage and access in the phonogram archive were finely tuned to 
each other. For their experiments, psychologists developed technologies for the 
production of múltiple copies and standards for identifying the rotation speed. 
They tested the degree of variation in recording quality, and they eventually discov- 
ered the interrelatedness of ways of recording and ways of listening. 

Among the thirty thousand wax cylinders preserved in the Berlín Phonogram 
Archive today, with musical recordings from all over the world, there are also several 
Ítems made for experimental use. These “experimental cylinders” ( Experimentalwalzen ), 
as they are classified in the lists of the archive (Ziegler 2006), are the focus of this 
chapter. 1 They show that the phonograph was not just a new apparatus ready at 
hand. Rather, experimental work was needed to create uses of the phonograph, to 
gauge its producís (the wax cylinders), and ultimately to enable recurrence and 
comparison as the basic operations of the phonogram archive. Tracing the history 
of the phonograph means tracing the “becoming” of a médium. 

Eight of the experimental wax cylinders were used to study the tone color of 
vowels. Taking this series of experiments as an example, my chapter addresses the 
following: First, I show that a médium is not a given entity but rather an unstable 
and heterogeneous object. In the 1980S, Germán media studies borrowed Michel 
Foucault’s notion of the “historical a priori ” (Foucault 1969) to analyze discourses 
of media culture. Most prominently, Friedrich Kittler transformed “historical 
a priori ” into “technological a priori' and discussed how “discourse networks” 
grow from new technologies (Kittler 1990, 1999). By showing how media produce 
inscriptions, such analyses investigated the way in which technologies selected and 
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shaped the utterances possible in a network by forcing communication into specific 
formats. 

To be sure, sound recording, as made possible by the phonograph, can be said 
to have created such a discursive network. On the one hand, the phonograph imme- 
diately suggested new ways of asking questions even though none of its material 
aspects were yet fixed. On the other hand, some level of technical reliability and 
established modes of use made it possible to study the phonograph itself, turning it 
from a technical object into an object of investigation (Rheinberger 1997). There is 
thus an inherent ambiguity in the use of a médium, seen in the process of its emer- 
gence. While in the beginning, the phonograph was immediately accepted as a sci- 
entific instrument, the commercial use soon dominated, turning the phonograph 
into something completely different. Scientists, however, did not stop using the 
phonograph but rather integrated its new functions into their own work. This 
chapter traces this ambiguity, looking at the emergence of the Phonogram Archive 
in Berlin. More specifically, I show that the researchers had to adjust, calibrate, 
and standardize the new functions again before integrating them into the scientific 
context. 

I thus propose a view of the complex history of sound as it moves through the 
laboratory. Science and technology studies (STS) offer the framing for a history of 
media that can accommodate the idea of change and allow for the analysis of the 
phonograph as a site of intersections between its various uses. 


Instruments 


In 1878 Edison’s phonograph entered the scientific community. Experimenters such 
as Jenkin, Ewing, and Cross were by no means unprepared to accept this new instru- 
ment. “Self-recording instruments” had had a role in the laboratories of the life 
Sciences since the 1840S, when the “kymograph” was invented; sound recording, 
though without a rendition of the graphically recorded pressure wave, soon became 
one of the kymograph’s specific uses (Sterne & Akiyama, this volume). In his first 
report on this new device, Cari Ludwig included a sample of the resulting curves, 
showing traces of two bodily functions. The registration of blood pressure was par- 
alleled by the registration of lung pressure for the same individual. Each of these 
movements had been traced onto the blackened surface of a turning cylinder at the 
same time. This simultaneous registration of two curves opened up the possibility 
of correlating body processes more generally, an effort numerous laboratory 
researchers engaged in during the second half of the nineteenth century. 

It is easy to overlook the fact that what made this instrument so important was 
the correlation rather than the simple fact of recording. The success of the new 
recording method lay in its promise of mathematically formalizing the body’s 
processes instead of merely visualizing them (de Chadarevian 1993). In later 
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developments of the kymograph, this correlation became standard, although it was 
featured less and less on the visible surface of the graphic tracings. Soon after the 
kymograph was introduced, tuning forks were integrated into the apparatus 
(Jackson 2006 and this volume). Their regular vibration, registered in parallel with 
the observed body functions, served as an indicator for the speed of the turning 
cylinder. The shape of the tuning fork’s curves enabled the observer to detect any 
irregularities occurring in the rotating movement. This new arrangement enabled 
the experimenters to register only one bodily function and then to correlate it with 
the time indicated by the fork. As the fork vibrated at, for instance, one hundred 
times per second, its curve related the observed body process indirectly to chrono- 
metric time. At this stage, the two elements necessary for correlation were still pres- 
ent in the shape of two registered curves. The function of reference was, in later 
kymographs, displaced by the supposed regularity of the cylinder’s movement. The 
function of correlating eventually thus became part of the technical device. The 
tuning fork, which at first seemed to replace the registration of a second bodily 
function, turned out, with hindsight, to have been an externalized control device. 
After the tuning fork, its curve, and its soft humming disappeared from the assem- 
blage, the function of control persisted but was internalized in the apparatus. 
The cylinder of the kymograph was now understood to represent a system of coor- 
dinates that allowed the curves to be read directly (i.e., as a visualization of the 
movement), which did not necessarily require mathematical calculation. 

The phonograph posed a new challenge to laboratory work: Experimenters had 
to use their ears. Jenkin and Ewing carefully embedded this new task in the more 
common procedures of gauging and comparison. Their experiment on the record- 
ing speed was actually part of a larger project that consisted of two series of experi- 
ments. Their short announcement about vowel quality in 1878 was followed by 
several longer reports on extensive studies of vowel traces, in which the phono- 
graph was seemingly understood as yet another device for producing visible traces. 
However, the traces were more difficult to obtain, as the authors explain: 

The experiments were made as follows: — The vowel under consideration was 
spoken or sung at a given pitch, determined by a piano, while the barrel of the 
phonograph was turned at a definite speed, regulated by means of a metronome. 

The indentations made in the tin-foil were then mechanically transcribed, so as 
to give curves representing a magnified section of the impressions. . . . All 
transcripts were rejected if the tin-foil did not continué to give the sound clearly 
after being used to produce these curves. (Jenkin and Ewing 1878, 340) 

On first impression, hearing seems to be an addendum to a technique of visualiza- 
tion in these experiments, turning the visible recordings into authentic traces 
of sound. Like many researchers before them, Jenkin and Ewing produced curves 
that they hoped would explain the nature of sound. It seems here that the audio 
recordings functioned mainly to provide further verification of the data recorded in 
the curves. Indeed, the bulk of the experiments were dedicated to the analysis of 
curves. In this respect the two researchers continued what had been done with 
devices such as the phonautograph (Sterne and Akiyama, this volume). Similar to 
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the many varieties of the kymograph — including the labiograph, laryngograph, 
or logograph, which recorded movements in vocal sound production and in 
the air — the phonautograph turned vibrations into a curve on blackened paper 
(Panconcelli-Calzia 1994; Rieger 2009). 

The phonograph, however, shifted the focus to hearing. This was due, first, to 
technical constraints, as the tin foil was designed to carry only the traces of the 
sounds. No other (i.e., visual) inscription could be made in that same médium. 
Second, this forced the experimenters to carry out various functions of adjusting, 
calibrating, and controlling the correct operation of the phonograph with their 
ears. Hearing was required both for comparing the pitch of the recorded sound to 
some given instrument and for controlling the speed of the barrel by means of a 
metronome. Listening to the recorded sounds after they had been mechanically 
transformed into a shape was supposed to guarantee that the indentations still 
referred to the same sounds. Instead of correlating two simultaneously recorded 
processes, another mode of correlating was now developed that involved the sense 
of hearing. It successively addressed different — visual and audible — recordings 
of the same process rather than correlating simultaneous recordings of different 
processes. 

The new way of producing data — data that in fact served as a token for 
listening — remained almost imperceptible as long as the main method of correla- 
tion still involved the familiar analysis of visible curves. In the experiments on the 
distortion of vowels, however, the correlation of what was actually heard during the 
experiment carne to the fore. Perhaps the most important shift between these 
experiments and Jenkin and Ewing’s later work was the following: A change of 
speed was applied to recordings. Thus, correlation now applied to audible data 
alone. The issue at stake here was not comparison between nature and recording 
but the transformation of recorded sound itself. 


Vowels 


The phonograph enabled experimenters to answer a still-unresolved question about 
the nature of sound. Scientific investigations in sound color did not begin in earnest 
until the nineteenth century, although already in 1761 Leonard Euler critically 
remarked in one of his Letters to a Young Germán Princess that the timbre of tones 
had escaped the attention of philosophers. As he explained to his addressee, listen- 
ers easily discriminated the loudness of tones. Also, music had taught them that 
tones vary in pitch, and it is on this differentiation that musical harmony was based. 
There was, however, another property of sounds that people often experienced 
when listening to music: “Two sounds may be of equal forcé, and in accord with the 
same note of the harpsichord, and yet very different to the ear. The sound of a ilute 
is totally different from that of the French horn, though both may be in tune with 
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the same note of the harpsichord, and equally strong” (Euler 1823, vol. 2, 68). The 
human voice, “that astonishing master-piece of the Creator” (69), could produce 
this variety of sounds simply by modifying the shape of the mouth. Although 
the consonants involved more “organs” than just the mouth cavity, such as lips, 
tongue, and palate, Euler claimed it should be possible to construct a machine 
that could articúlate the sounds of language: a The thing does not seem to me 
impossible” (70). 

Subsequently, various attempts were made to construct such a device. 2 In 1780 
Danish physician and physicist Christian Gottlieb Kratzenstein won a prize awarded 
by the Imperial Academy of Saint Petersburg for the successful construction of 
organ pipes that could imítate the vowels of language (Ungeheuer 1983, 157). 
Wolfgang von Kempelen constructed a talking machine around 1780, an apparatus 
that emitted entire phrases in various languages. In this machine, bellows sent air 
through a variety of devices that would perform different aspects of articulation, 
such as phonation and the formation of vowels and sonorous and noisy conso- 
nants. In the final stage, the air passed through a malleable leather bell whose shape 
could be changed by hand, thus altering the quality of the “vowel” sounds (Kempelen 
1791). The phonation device in the machine could produce only one pitch — 
Kempelen did not intend his machine to sing. Nevertheless, he noticed that a 
sequence of vowels would sound like a melody. This observation, however, was a 
by-product of his work for which he did not provide an explanation. By the 1830S 
the earlier attempts to imítate speech had attracted the attention of experimen- 
tal researchers. Physicist Robert Willis conducted a series of experiments on 
vowel sounds and published his results in 1830. He criticized his forerunners for 
considering vowel sounds only with regard to articulation: 

KempeleiTs mistake, like that of every other writer on this subject, appears to lie 
in the tacit assumption, that every illustration is to be sought for in the form and 
action of the organs of speech themselves, which, however paradoxical the 
assertion may appear, can never, I contend, lead to any accurate knowledge of the 
subject. (Willis 1830, 233) 

In a long article, tc On Vowel Sounds, and on Reed-Organ Pipes,” Willis emphasized 
that the means to produce the sounds did not have to resemble the organs of 
speech. One of the experiments he described broke completely with the idea of such 
similarity. While the article is mostly about air columns in reed pipes, this one 
experiment did not involve any wind instrument. Holding a piece of watch spring 
against a revolving toothed wheel, an alternation of sound qualities was produced 
that depended on the length of the vibrating portion of the spring. As Willis 
observed: 

In effect the sound produced retains the same pitch as long as the wheel revolves 
uniformly, but puts on in succession all the vowel qualities, as the effective length 
of the spring is altered, and that with considerable distinctness, when due 
allowance is made for the harsh and disagreeable quality of the sound itself. 

(249-50) 
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Some thirty years later, Helmholtz reported on this experiment to strengthen his 
own argument that sound color is independent of a particular sound source. 
Singling out Willis’s experiment as the one in which the similarity of sound sources 
was most clearly abandoned, he remained critical: “Willis’s description of the 
motion of sound for vowels,” he commented, “is certainly not a great way from 
the truth; but it only assigns the mode in which the motion of the air ensues, 
and not the corresponding reaction which this produces in the ear” (Helmholtz 
1885, 118). 

By 1863, when his comprehensive study on hearing, On the Sensations of Tone 
as a Physiological B asis for the Theory ofMusic , appeared, Helmholtz had investí - 
gated the sounds and functions of musical and acoustical instruments, including 
the reed instruments (Jackson, this volume). In reed pipes, a stream of air that is 
regularly interrupted produces the tone. An opening that opens and closes cuts the 
airstream into parts with the help of a reed. The resulting air “puffs” will be heard 
as a tone whose pitch depends on the velocity with which the puffs follow each 
other. This family of instruments included not only some types of organ pipes and 
the pipes of the harmonium (i.e., the reed organ) but also the human vocal tract, 
the “singing voice.” 

In the voice, Helmholtz considered the vocal cords or the “membranous tongs” 
to perform the part of the reed. The sound of the voice and more specifically of 
vowels depends both on these “membranous tongs” (i.e., the vocal cords, which 
can change the velocity of their movement freely) and on the air chamber (i.e., the 
mouth cavity, which can change its shape). Both parts of the sound production can 
vary independently. The voice can sing a melody on the vowel a alone, or stay on 
the same pitch level and pronounce a , i, and o. The two varying parameters of the 
voice thus had their corollary in the mechanism of articulation. The specificity of 
the vowel sounds lay, it was assumed, in the relationship between the vocal cords 
and the air chamber. Helmholtz stated that the pitch range of the vocal cords is, in 
most cases, lower than the resonance tone of the air chamber. Therefore, the air 
chamber reinforces, he assumed, one of the partials of the sound produced by the 
vocal cords. 

Helmholtz’s further investigation of vowels as sound colors divided acousti- 
cians into two groups. In the late 1850S he had started publishing his first experi- 
ments on the sound color of vowels (Helmholtz 1859). The main impulse for taking 
up this question was a mathematical theorem originally proposed by Jean-Baptiste 
Joseph Fourier at the beginning of the nineteenth century. It claimed that any peri- 
odic wave could be formally described by its sinusoidal components. Building upon 
prior assumptions by Georg Simón Ohm, Helmholtz applied this theorem to the 
periodic waves of musical tones. He claimed that if Fourier’s theorem applied to 
these components of a tone, it predetermined the relationship of the components 
as forming integer ratios. 

The sound color of vowels served as Helmholtz’s most prominent object of 
investigation. He was even able to synthesize a number of vowels from sinusoidal 
components that he obtained from a set of amplified tuning forks. The sounds of 
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vowels fitted the experiment all too well. They built up a set of sounds that were 
easy to discrimínate even when pronounced under the most varied conditions, such 
as speaking, singing, or whispering. As it turned out, they were also comparatively 
easy to synthesize. At first glance, their sound color seemed to be determined by the 
relative strengths of the different sinusoidal components. This was in keeping with 
the analysis of musical instruments that Helmholtz carried out. Specific patterns in 
the components, such as the relative strength of every other component in the series 
of overtones in the sounds made by a clarinet, made it likely that these patterns 
would reappear in every note the instrument was able to produce. The vowels 
Helmholtz synthesized revealed a similar pattern: for an u (pronounced in the 
Germán manner), only one component was enough; an o could be produced with 
two strong components, while reinforcing the third component in the row of tuning 
forks produced a sound more similar to an a. Vowels thus seemed an exemplary 
case for a study of sound color in general. 

However, the analysis of vowels revealed something different from Helmholtz’s 
findings on the sound of musical instruments. The characteristics of vowels 
depended not just on a relative pattern of strong components but also on the abso- 
lute pitch of one or more of the stronger components in the sound spectrum. This 
had gone unnoticed as long as the production of sounds was tied to the fixed pitch 
of the tuning forks. With tuning forks a modification of pitch was almost impossi- 
ble. Yet, the two aspects of articulation rather easily explained this phenomenon, as 
the shape of the mouth did not change when a vowel was sung at different pitches. 
Therefore, the vocal cords alone produced the difference in pitch while the mouth 
alone produced the differences in sound color. Henee, Helmholtz inferred that 
vowels differed from musical instruments in that their characteristics involved an 
element of absolute pitch. For e and i, he even discovered two such elements that 
had been too high for his tuning fork synthesizer to produce. 

Jenkin and Ewing immediately responded to the invention of the phonograph 
by seeing it as the device that would shed light on the question of the color of 
sounds. They perhaps understood the new possibilities for manipulating record- 
ings that the phonograph offered before grasping the meaning of the phonograph 
as producing audible and manipulable tokens for acoustic research. They saw the 
phonograph as a laboratory instrument that would enable them to carry out the 
controlled transposition of frequencies. A change in the rotation speed would 
alter all recorded frequencies in the same way. In this respect, the phonograph 
actually resembled an existing acoustic apparatus: the siren (Jackson, this volume; 
Welsh 2008). This instrument consisted of a rotating perforated disc through which 
a stream of air was passed. When one of the openings in the disc crossed the 
airstream, the air could pass through the disc; thus, a regular series of air pulses 
emerged, which, starting at a certain frequeney, were heard as tones. The typical 
howl of the siren was due to the fact that the rotation had to be started at zero 
velocity and reach the required frequeney only gradually. Assuming that the sound 
of the siren was composed of partíais that were in accordance with Fourier’s 
theorem, researchers such as Helmholtz or Jenkin and Ewing could infer that the 




A GRAY BOX 


185 


transposition of spectra was feasible. In the howl of the siren, one could not make 
out any apparent change in color. 

With its feature of transposing entire sound spectra, the phonograph opened 
up a new possibility of testing the two explanations for sound color Helmholtz had 
offered. His explanation for the tone color of musical instruments implied that 
their sound should not be affected by transposition, while his explanation for the 
tone color of vowels implied that transposition would cause distortions. The pho- 
nograph thus suggested the experimentum crucis to decide between these two expla- 
nations. Jenkin and Ewing heard no change in tone color. Again, the immediate 
respon se, this time by Charles Cross, showed that this experiment involved more 
than just the observation of physical phenomena. The vowels form a system of 
differentiations that allows for great deviations, while maintaining the ability to 
communicate the differences. 

As it turned out, the first experimenters had to sort out what the term quality 
actually referred to when applied to the distorted vowels. Soon after they had pub- 
lished the first notice in a letter to the editor of Nature Jenkin and Ewing wrote the 
following: 

We venture, however, to remind any one trying the experiment that a low note 
folio wed by a high one suggests a change from u (Italian) to i. Thus if we whistle 
a low note and then the octave to it or a note near this, the ear is easily persuaded 
that the whistle resembles u i, but if now, beginning again on the note we just 
thought was ¿, we go up another octave, the new sequence again suggests u i, 
although the very note which was last taken to represent i now stands for u. If, 
therefore, we wish to judge what a sound really is we should not trust much to 
contrast, especially when a change of pitch is involved in the comparison. (Jenkin 
and Ewing 1878b, 167) 


Howls 


In 1914 and 1916, when Stumpf carried out his vowel experiments, his situation was 
very different. For one thing, he noted the awkwardness of his situation: In the 
laboratory, he and his assistants listened to almost imperceptible sounds while out- 
side, beyond the city limits, the war roared. This silence is documented on one of 
the cylinders, which contains a soft noise, showing the attempt to use the phono- 
graph for the investigation of whispered vowels. The analysis of the whispering 
voice had by then become a standard item in the investigation of the resonances of 
the mouth cavity. Dutch physiologist Franciscus Cornelis Donders had inferred 
from Helmholtz’s investigation that the mouth cavity would render its tone even 
without being stimulated by the vocal cords. This gave rise to the question of intel- 
ligibility, which was to be investigated in terms of communication technology 
(Schmidgen 2007). 
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In a very different sense, however, Stumpf and his team listened to the howl of 
the siren. All of a sudden it appeared as a constant change in sound quality. A new 
way of listening posed the question of the vowel sound anew, but the object of lis- 
tening was now the apparatus and its properties. The phonograph turned from a 
technical device into an object of investigation. The Berlin laboratory for experi- 
mental psychology, after more than ten years of work building the phonogram 
archive, was well equipped with a number of different phonographs. The labora- 
tory protocols reveal that some experiments were carried out with a device called a 
“parlograph,” designed for office use where it was supposed to substitute for face- 
to-face dictating . 3 The parlograph thus exemplifies a specialization in the develop- 
ment of the phonograph by focusing on one of the functions its inventors had been 
busy promoting . 4 Stumpf s interest was more in comparing the recording devices 
that were available with regard to their potential use in musicology and experimen- 
tation. The instrument used in most of the experiments, however, was a phono- 
graph that was more appropriate for the purpose of changing the rotation speed. Its 
range of possible rotation speeds was large, allowing for a triple augmentation of 
the lo west speed. Together with his assistants, Stumpf carried out two series of 
tests — eight wax cylinder rolls in all were recorded . 5 The first series consisted 
of three cylinders, one for each rotation speed: slow, normal, and fast. These cylin- 
ders clearly revealed the experimental nature of the enterprise. On one cylinder, a 
voice is clearly heard asking for the tuning forks as the next item to be recorded: 
“und jetzt die Gabeln” (“and now the forks”); a howling sound follows this speech. 
It appears that the fork had been brought too cióse to the recording funnel and had 
thereby distorted the sound. 

A list of the recorded sounds is given in the protocols, as well as in Stumpf s 
report in Die Sprachlaute. In addition to sung vowels, the cylinders recorded the 
sounds of tuning forks, organ pipes, and artificial vowels produced with a vowel 
tube (a device developed by Robert Willis in the 1830S). Thus, the recorded Ítems 
combined sounds of the voice with sounds of laboratory instruments. All of these 
sounds were carefully chosen to enable maximal control of distortion. For instance, 
on the first series of cylinders not all vowels were recorded, only the u and o were 
juxtaposed. With only two vowels the systemic effect of recognizing the whole 
sequence of vowels could be avoided — a problem that had led former researchers to 
overhear the actual distortion of the sounds. The choice of these two vowels was 
sufficient to verify Helmholtz’s assumption. Stumpf agreed with Helmholtz in that 
he took u to come closest to a sound with only one component of no fixed pitch. 
Thus, u should remain unchanged when the rate of rotation was altered. An altera- 
tion of u should in turn indicate that something was wrong in the experiment. In 
contrast, o should easily turn into an a when the speed was increased if Helmholtz’s 
hypothesis about absolute pitch characterizing vowels was correct. The change in 
speed should not only alter the vowels but also create a difference between them on 
account of their transposition. 

The next series of five cylinders focused on sung vowels, juxtaposing them 
with two spoken words and a series of tuning fork sounds. For these cylinders, 
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the complete series of Germán vowels, including ü , 0, and a, was recorded, followed 
by the words Kuckuck (cuckoo) and Uhu (owl). Five recording speeds were tested 
by replaying the recording at the same speed, as well as a number of different speeds. 
The distortions varied and pointed to different problems in recording. The tuning 
forks indicated the frequency range in which a recording could be made. Under 
certain conditions, their sounds would simply disappear. For the other sounds, 
squeaking and bleating tones were heard, as were resemblances to various musical 
instruments. In most cases, these changes could be explained by the so-called for- 
mant theory, which assumes stable amplitude peaks in the frequency spectrum. 
When using very low or very high rotation rates, the distortions were less easy to 
understand. Additional overtones appeared: Kuckuck turned into the nonsense 
syllables Tretre. Stumpf concluded that such phenomena must be explained by 
physical distortions that occurred when the phonograph was used in extreme ways. 
In other words, as any other médium, the phonograph had a linear characteristic 
for only a small región. Beyond the limits of this región, nonlinear distortions 
occurred. Using vowel sounds as objects for their experiments in recording quality, 
Stumpf and his team found two characteristics of recorded sound that were of the 
utmost significance for the use of the phonograph in ethnomusicology. The first 
was that sound color changed when the reproduction speed was altered. As long as 
the musicological investigation of the recorded sounds had not yet determined 
which of its characteristics were significant, such distortion could endanger the 
significance of the whole enterprise. Second, with the use of sung vowels and 
spoken words as the object under investigation, they also showed that an incorrect 
reproduction speed could destroy the meaning of a recorded item. 


Cylinders 


The group of researchers who took part in these experiments was remarkable in 
itself. Stumpf was assisted by, among others, Max Wertheimer and Erich Moritz 
von Hornbostel (i.e., by one of the founders of Gestalt psychology and one of the 
founders of ethnomusicology). 6 The latter had been director of the Phonogram 
Archive for more than a decade. Although this was still not an official position, 
Hornbostel had invested a considerable amount of time and money in the enter- 
prise. In his growing compendium of published work, methodology was a central 
issue. Most notably, two articles he coauthored with Otto Abraham laid the foun- 
dation for comparative musicology, both giving center stage to the phonograph: 
Über die Bedeutung des Phonographen für die vergleichende Musikwissenschaft (On 
the Meaning of the Phonograph for Comparative Musicology) (1904) and Vorschlage 
für die Transkription exotischer Melodien (Suggestions for the Transcription of 
Exotic Melodies) (1909), where transcription meant mostly transcription from 
recordings. Most of HornbosteEs subsequent publications on specific musical 
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examples were based on the collection of the archive, often pointing to this 
origin with titles such as Phonographierte türkische Melodien (Phonographed 
Turkish Melodies) and Phonographierte Indianermelodien aus British Columbio 
(Phonographed American Indian Melodies from British Columbia). 

In cooperation with Félix Luschan, the first chair of ethnology at Friedrich- 
Wilhelms University in Berlin, Hornbostel prepared instructions for travelers on 
how to use the phonograph in the field. It was integrated into a guide for ethno- 
logical travels published in the journal Zeitschrift für Ethnologie (Journal for 
Ethnology). The Phonogram Archive handed out an offprint of this guide to travel- 
ers who were willing to make recordings and bring them back to Berlin. In many 
cases, the phonographs themselves were also provided by the archive, as were tuned 
whistles to be used to indícate the rotation speed. In the article, significant consid- 
eraron was given to how the recordings in the archive might be used. In order to 
safely identify them, some preventive measures had to be taken. As any written 
information about a roll could get lost or be attached to the wrong item, the travel- 
ers were instructed to record the identifying data directly onto the wax cylinder. 
They were instructed to first blow the whistle and then briefly classify the prove - 
nance of the recording. The rotation speed could later be easily identified from the 
pitch of the whistle. This instruction added an acoustic tag to the recording, which 
became the basis for comparison. Without this tag, the recording of unknown 
sounds would be useless for the musicologists using the archive. With the help of 
the whistle, however, the traveler could safely identify the speed of the recording 
and thus guarantee that the sound color was correctly reproduced. As a conse- 
quence, the traveler could deliberately choose whether greater length or higher 
quality were more important in a given situation. The texts of epic poetry, for 
instance, would often require more time, while for music better quality seemed 
indispensable. This choice was backed up by the pitch tag, which connected the 
recording to the collection of other ítems in the archive. 

In 1908 the Phonographic Journal reported on a recording that had been made 
in the ñame of preserving the culture of a Slavic people living in the Lausitz región 
of Germany. The trip that led to the recording was organized in March 1907 by 
the author of the report, Barón von Hagen. Hagen had invited the members of the 
Phonogram Archive to join him in recording “Wendish” music. This was the 
Germán ñame for the Slavic minorities living in Germán territory. 

On behalf of Geheimrat Stumpf (the Psychological Institute had been invited), 

Dr. Hornbostel appeared with a small phonograph. However, Dr. von Hornbostel 
recorded only the spinning songs of an oíd Wendish woman because Geheimrat 
Stumpf has not yet shown interest in [recording] speech or in things already set 
to music (von Hagen 1909, 460). 

Barón von Hagen had other interests, as he declared in this and other articles 
that he and like-minded people published in the journal. He believed it should be 
the responsibility of a phonogram archive to preserve and foster Germán culture. 
This required archiving the voices of prominent people and making a survey of 
the Germán dialects and the non-German languages spoken within the territory 
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of Germany. For teaching purposes, a third section of the archive should contain 
samples of languages other than Germán. A fourth section should provide materials 
for physiology and voice therapy. The last section, he suggested, should be a music 
collection, divided in two parts, one part containing vocal samples for the teaching 
of singers and the other — corresponding to what the Berlin Phonogram Archive 
already did — with samples of music from all over the world (von Hagen 1909, 382). 
In the late nineteenth and early twentieth century, the obsession to inscribe one’s 
own culture into history also turned into an obsession with preserving one’s pres- 
ent. The phonograph offered a twist on this desire for preservation, a twist that 
projected it into the future because the phonograph created as it preserved (Sterne 
2003, 332-3; Hoffmann 2004). All o ver Europe similar activities took place, such as 
the Musée phonographique in Paris (Brain 1998, 277ff.) or the later activities of 
Germán linguist Wilhelm Doegen for the Germán Sound Archive (Lange 2007). 

Von Hagen’s somewhat eclectic program was held together by commercial 
interests. Hagen urgently advised the Phonogram Archive to get in touch with the 
emerging recording industry, which, he hoped, would support them in collecting 
samples of languages and music abroad. This link between the professional record- 
ing industry and scientific enterprises has been noted elsewhere in this handbook 
(e.g., Bruynincks, Whittington, this volume). Most important, Hagen tried to con- 
vince the Phonogram Archive to abandon the phonograph and to use gramophone 
technology. The main advantage of the gramophone was that its dises were easier to 
copy. Copying initiated the double success of the gramophone on the market, and 
soon it surpassed the phonograph. In contrast to the phonograph and its cylinders, 
the gramophone and its records thus split into two industrial branches. Selling 
sounds on gramophone dises became a business in its own right, and the sound 
quality became subject to competition and secrecy. Duplication technology, of 
course, was the technological basis for the commercial success of the dises. 
Phonographic recordings posed greater difficulties in copying. Hornbostel, a trained 
chemist, eventually succeeded in solving this problem. Electroplating allowed a 
matrix to be made of the cylindrical shape of phonograph recordings. For a while, 
the Berlin Phonogram Archive even led the technology of duplicating wax cylin- 
ders. Although this method was risky (it destroyed the original in the process of the 
galvanoplastic duplication), other collectors trusted HornbosteFs method to such 
an extent that they sent their origináis to Berlin in order to obtain “galvanos” (i.e., 
the matrices) and copies in exchange. This allowed the collection in Berlin to grow 
even faster as a copy of each electroplated cylinder was kept. For commercial pur- 
poses, however, the method was too costly. The material used to make the copies 
was found only after a long series of triáis. 7 The phonogram archive repeatedly 
appealed for funds, mostly asking the government to support the work. 8 

Barón von Hagen and an anonymous supporter ridiculed the phonogram 
approach in numerous articles published in the Phonographische Zeitschrift . 9 They 
pleaded for a strong commitment to the gramophone’s recording and duplicating 
methods as this would open up the potential for a self-sustaining basis for the archi- 
val work. Gramophone dises could be sold to schools or to individuáis interested in 
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learning foreign languages and thus would return some income to the scientific 
enterprise. 

Abraham and Hornbostel, however, opted for the technology that enabled 
users to record themselves. What was even more important to them was the possi- 
bility of manipulating recordings for research purposes. They discussed the pros 
and cons of the “phonographic technology” at hand in their 1904 árdele on the 
significance of the phonograph for comparative musicology: 

In recent times, phonographic technology has made great progress. The phono- 
graph was followed by the gramophone, and both apparatuses contest each other, 
each has its advantages and shortcomings. Both use membrane vibrations, which 
are transmitted to a pivot by a lever, which in turn writes its movements into the 
waxen surface. The phonograph does this vertically, punctuating its movements 
into the laterally moving wax cylinder. In contrast, the gramophone uses dises 
instead of cylinders; the pivot draws the undulating shape of the tonal vibrations 
onto the dises’ surface. (Abraham and Hornbostel 1904, 231) 

The exact kno wledge about the gramophone’s recording technology, Hornbostel 
and Abraham added, remained an industrial secret. For this reason the gramophone 
could not be used for prívate recordings. 10 A great advantage of the gramophone 
was the convenient storing of dises. To this end, the Viennese Phonogram Archive, 
which was established shortly before the Berlin Archive, had invented its own 
method of transcribing wax cylinders onto dises, using vertical cutting similar to 
Edison’s phonograph. This technology, however, tied the apparatus to its location. 
It was so heavy that it had to remain in the rooms of the Austrian Academy of 
Sciences, which hosted the archive. The phonograph, in contrast, was easy for one 
person to carry. In his travels to the Lausitz, Hornbostel brought only a small pho- 
nograph. However, the recordings he made were futile, not so much because of 
their allegedly inferior quality but because he was not allowed to play them. These 
recordings are preserved today in the archive, but in every container there is a note 
stating “playing forbidden.” The right to market and to listen to them belonged to 
von Hagen only. 1 1 

In the Berlin archive, the phonograph remained in place as the recording device. 
Although, back in 1900, Stumpf had been given an opportunity to try wire record- 
ing — then called “telephonography” — the archive stayed with the wax cylinders. 
Over the years, methods of identifying the cylinders were developed, the safest 
being inscription on round labels that were glued to the cylinders’ boxes. These 
labels were printed for the Psychological Institute, allowing for the inscription of 
categories such as inventory and catalogue number, as well as the tribe, object, 
person reciting and person recording, and the place and date of the recording. Most 
often, these inscriptions were repeated on the bottom of the box to minimize the 
danger of errors. These boxes were also used for the copies that the archive obtained 
from collectors who had asked for galvanos. 

For the experimental recordings, these identifying measures were only rarely 
used. A collection of song performances initiated by Otto Abraham as early as 1907 
to measure how closely performances matched given intervals is still stored in boxes 
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with archive numbers 1401 to 1416 and catalogue numbers Exp. 18 to 34. But the 
identification of many others still remains questionable. The recordings were not 
copied, so the boxes contain the original wax cylinders. In addition to damage from 
mold, the recordings are spoiled in many places by the distortions produced by 
repeated playback of specific parts, thus clearly pointing to the methods that were 
used when measuring the recorded songs. When listening to the recordings, the 
musicologists used the cylindrical shape to create loops. As Hornbostel and Abraham 
wrote in their methodological instruction, “In order to get to hear a single note in 
isolation, one has to make the membrane [i.e., the needle] touch the cylinder while 
hindering its lateral movement by switching off the guide conduct. This method, 
however, deteriorates the recording” (Abraham and Hornbostel 1909/1910, 17). 

Abraham’s recordings make this deterioration palpable. Abraham published 
his results in 1921 in an article titled a Tonometrische Untersuchungen an einem 
deutschen Volkslied” (Tonometric Study of a Germán Folksong) (Abraham 1923). 
In order to get people to sing in an automated manner rather than reflect on the 
pitches, Abraham chose to have the national anthem serve as the “folksong” in 
question. Hornbostel, who had perfect pitch, recorded the anthem several times. 
He whistled it and sung it without words, first in one key, then transposed by one 
half-tone, and then very low and very high. His performance as it is preserved today 
is interrupted many times by the traces of measuring isolated notes. The recording 
eventually gets stuck when Hornbostel sings in the highest register — the cylinder 
loops his crackling voice when he bursts into laughter at his own singing. 


CONCLUSION 


The historiography of recording has done great Service in enumerating technical 
devices: First, the phonograph was invented, shortly after that the gramophone, 
then around the turn of the century, wire recording comes in, and so forth. Not 
surprisingly, the straightforward accounts, indispensable and rich in historical and 
archival detail as they may be, have long been challenged. It has remained difficult, 
however, to focus on technology at the same time as integrating it with cultural 
history. 

An important achievement in this respect has been Friedrich Kittler’s quest for 
the technical conditions of cultural strata, which brought about a new perspective 
on technology. Given that at the time of Kittler’s first pathbreaking writings (1990, 
1999), the Computer was opening up a new reality of media use, this search for the 
technical a priori was also pertinent to a new theory of media — one that could 
describe how symbolic codes were dissolved by self-registering apparatuses only to 
be turned into symbolic code again, if on a completely different, binary level. Sound, 
for example, was transformed into curves to be then digitally encoded. For the 
problem ethnologists encountered when they transcribed the music engraved 




192 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


on wax cylinders, Computer technology developed new Solutions. A century after 
Hornbostel and Abraham wrote down their ideas, musicians and ethnologists are 
now able to use special functionalities for transposition in sound software. These 
tools use digital encoding to avoid the distortion of pitch and sound quality that 
occurs when a recording is replayed at a slower speed in the analogous mode. 

Media analysis in the wake of Kittler’s search for technological a priori thus 
set for itself two stable entities: a contemporary point of departure, such as the 
Computer and the discursive network enabled by the Computer on the one hand, 
and a history of symbolic encoding, binary calculus, the history of investigating 
temporal resolution on the other hand among the conditions that made Computer 
technology and its uses possible. Moving backward from a given State, the analysis 
of the technical a priori eventually discovered some primary conditions for this 
State. Around the turn of the twenty-first century, media studies started to 
question the historical framework of this analysis. Since then, media analysis has 
been understood to help focus on the instability of the notion of a médium. 
Focusing, for example, on a historiography of media, such studies show that most 
accounts of media technology could not work without a notion of médium that 
took its definition for granted rather than explaining it. While this had been excus- 
able with regard to the simple account of technological development, in later 
histories of recording technology, such a stable notion of médium was concealed 
as well. The history of recording and its meaning for an auditory culture often pre- 
supposed that, once the phonograph was invented, any of its later uses could have 
been anticipated. 

When viewing technology as the condition of culture one can easily get trapped 
in this presupposition. Other accounts of recording history have therefore strived 
to keep the definition of a device open to redefinition. For instance, Emily Thompson 
has urged researchers in her account of the history of the phonograph to accept that 
just after its invention “there was no single role or purpose for the invention to 
fulfill. The phonograph appeared before a need for its function had been identified” 
(Thompson 1995, 137). With this methodological caveat in mind she did indeed 
keep her approach to the history of the phonograph open, particularly with respect 
to an important element in the historiography of media — feedback, in her case 
feedback between consumers and technology. Obviously, however, the functions of 
the phonograph were not exclusively defined by consumers. One should not under- 
estimate the importance of finding the modernity of the phonograph in consump- 
tion any more than one should also stress that consumption is always bound to 
some consumable object. 

What is important in this story of the phonograph also within the context of 
Science and technology studies is that it is not simply a story of black boxing. When 
dealing with the phonograph as the object of study, one must keep in mind that a 
process of standardization covers only some of the important aspects of the phono- 
graph as a médium. A narrative of standardization has an inherent directionality: 
It tells how a function becomes fixed or, in other words, black boxed. While 
both the functions of recording and playback and the concrete instrument of the 
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phonograph became standardized to some extent during the time the phonograph 
existed, their interplay often counteracted standardization. The black box had to be 
reopened, and some light had to be shed on its functioning, thus turning it into a 
gray box. 

The story of the vowel experiments proposes a different approach to the history 
of the phonograph because it foregrounds the interaction between scientists and 
technological development. 12 This story traces the múltiple shifts from exploring 
sounds by using a specific instrument to exploring a technology by using specific 
sounds. Introducing the phonograph as a scientific instrument allows the shift from 
an object to a means of investigation to be described and vice versa. As shown, the 
phonograph was immediately accepted by acousticians as another recording device. 
Technology thus takes center stage, but it also puts the emphasis on manipulation 
rather than standardization. Aside from the history of technical development and 
the history of its uses, there is a history of the phonograph as an unstable object — a 
gray box — in the context of research. 

The story of the vowel experiments mirrors the various histories of the phono- 
graph in a peculiar way. While every experimenter who repeated this experiment 
usually pointed to the improving quality of the device, the word quality describes 
this change insufficiently. A detail in the functioning, such as the rotation speed, 
must be understood differently in different historical contexts. Varying the speed 
meant different things in each case. If for Fleming and Jenkin speed was just one 
technical element among many that they had to control, rotation speed turned out 
to be the crucial characteristic of this new device because it allowed for the manipu- 
lation of recorded sound. After 1900, the situation changed profoundly. Ethnology 
brought new uses for the phonograph. Ethnological recordings, however, were 
worthless without information about the rotating speed. Still, ethnologists, did not 
want see the speed of recording and replaying be shut away in a black box. They 
preferred to stay in charge of this technical detail. Asking travelers to measure the 
speed by recording a known note, the ethnologists similarly avoided losing control 
of the issue of standardization. With such details, the phonograph points to fea- 
tures of sound recording that would become central in further research on sound 
and hearing: recurrence and manipulation. As a technological object, the phono- 
graph quickly stabilized. Its basic elements — the needle, the barrel and spring, the 
membrane and funnel — point to functions such as engraving, transport, and ampli- 
fication. Because they can be found at any stage of the technical development of 
sound recording, they are not specific to the phonograph but rather pertain to 
sound recording in general. Also, particular uses of the device soon crystallized, 
which pro ved to be of considerable social impact. 13 On closer inspection, however, 
none of these functions and uses were as stable as they seemed. Rather, in order to 
understand what could be done with a phonograph, one had to consider what could 
not be done with it. The specificity of the phonograph did not reside in its stable 
functions and uses but rather in a specific instability. A historiography of media has 
to look at the “becoming-media” (Vogl 2007), that is, the gradual coming into being 
of media, as Joseph Vogl has suggested. 14 
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Media involve perception. However, at the same time they replace it with tech- 
nology. First, the phonograph created sounds that turned perception upon itself: In 
the context of experimental psychology, the phonograph enabled the experimenter 
to retain the identity of the investigated recording, while at the same time produc- 
ing variation in the perceived phenomena. Second, the phonograph thereby created 
points of reference: In the context of the phonogram archive, recurrence was essen- 
tial, guaranteeing the repeated accessibility and the identity of the recorded phe- 
nomenon in each act of investigation. Finally, the phonographic recording provided 
a field of possible perception — the archival collection of data that would be open 
for questions that the researchers could not yet foresee. This collection brought into 
awareness that it would remain incomplete not so much because there was always 
more to be collected but more so because there would always be new ways of deal- 
ing with the preserved material. For comparative musicology, which would later be 
renamed ethnomusicology, 15 the phonograph provided the elementary technology 
for recursion and comparison. Its use in these emerging disciplines was supported, 
however, by a sepárate strand of research that intersected with ethnomusicology in 
the Berlin laboratory. Here, functions of the phonograph were tested (Bijsterveld, 
this volume), simultaneously revealing functions of hearing. These experiments 
involved more than Science. Consumer uses, political interests, and cultural settings 
shaped the ways in which the experiments employed the phonograph, as well as the 
ways in which their results were fed back into culture. These experiments took place 
at the intersection of music psychology, ethnomusicology, experimental phonetics, 
and psychology — the phonograph serving as the point at which these disciplines 
converged. 


NOTES 


1 Ziegler (2006, 83) mentions 106 cylinders in the collection of 
“Experimentalaufnahmen” (experimental recordings). A number of digitized recordings 
from this collection are available at http://vlp.mpiwg-berlin.mpg.de/library/audio.html 
(accessed 30.09.2009). 

2 Accounts of the history of speaking machines are given in Ungeheuer (1962, 1983); 
Hankins and Silverman (1995, 179-220); and Felderer (2002). Further attempts to construct 
speaking devices are mentioned by Hankins and Silverman (e.g., by Pope Sylvester III, 
Robert Hooke, and, in the late eighteenth century, the Abbé Mical and Erasmus Darwin). 

3 Cf. Ethnologisches Museum Berlin, Preufíischer Kulturbesitz. Berliner Phonogramm- 
Archiv: Cari Stumpf Papers on Acoustics, Envelope 17, Phonographische Versuche. 

4 For the use of the phonograph, see Thompson (1993); more generally on early 
recording, see Reed and Welch (1994). 

5 Cf. http://vlp.mpiwg-berlin.mpg.de/library/data/lit38711, http://vlp.mpiwg-berlin. 
mpg.de/library/data/lit38712, http://vlp.mpiwg-berlin.mpg.de/library/data/lit38713, http :// 
vlp.mpiwg-berlin.mpg.de/library/data/lit38894, http://vlp.mpiwg-berlin.mpg.de/library/ 
data/lit38895, http://vlp.mpiwg-berlin.mpg.de/library/data/lit38896, http://vlp.mpiwg- 
berlin.mpg.de/library/data/lit38897, and http://vlp.mpiwg-berlin.mpg.de/library/data/ 
1^38898 (accessed Sept. 30, 2009;, cf. footnote 2). 
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6 On Wertheimer as a pupil of Stumpf see Ash (1995); on Hornbostel see Klotz 
(1998). 

7 The recipe for this was lost until 1989, when the Berlín collection was reassembled. 
See Ziegler (2006). 

8 Cf. the appendix in Kaiser-El-Safti (2003). 

9 Anonymous (1907, 1908); von Hagen (1908); cf. also Stumpf (1908). 

10 Like the phonograph, the gramophone was originally a recording and a replaying 
apparatus. Cf. Moore (1999). 

11 Cf. Susanne Ziegler’s comment in the booklet to Music! 100 Recordings : 100 Years of 
the Berlín Phonogramm-Archiv 1900-2000 (Wergo LC 06356 2000), 125-27. 

12 In this respect, my account of the phonograph comes cióse to the accounts of 
interaction between instruments and their users given in Joerges and Shinn (2001). 

13 The use of the phonograph has been critically reviewed by Jonathan Sterne (2003) 
under the heading of U A Resonant Tomb.” Here, Sterne shows how preserving sound for 
eternity became a leitmotiv in the discourse on the phonograph. 

14 Vogl (2001, 2008) and more generally the corresponding issue of the Archivfür 
Mediengeschichte with a focus on media historiography, ed. Lorenz Jáger, Bernhard Siegert, 
and Joseph Vogl, as well as the issue on “New Germán Media Studies” in Grey Room 

(29) (Winter 2008), ed. Eva Horn. 

15 The discussion about the naming continued until the 1950S; cf., for example, 
Kolinski (1957). 
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Introduction 


This chapter analyzes the ways in which nineteenth-century acoustical instruments 
that were meant to standardize musical performance and measure various dimen- 
sions of sound, such as pitch and beat, were a century later put to use as musical 
instruments themselves. Metronomes (and their predecessor, the chronometer) 
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and tuning forks migrated from mechanicians’ workshops to bourgeois house- 
holds, rehearsal halls, and physics and physiology laboratories and then back to 
concert halls, where they were the primary instruments of a number of twentieth- 
century compositions. Similarly, sirens, another instrument employed by nine- 
teenth-century acousticians for accurately determining musical pitch, were heard 
with increasing frequency in the twentieth-century music halls of New York, Berlin, 
and París. Drawing upon a material cultural history of Science and technology, this 
chapter traces the ways in which these objects were redefined by their new roles as 
the generators rather than the quantifiers of musical qualities by exploring both the 
use of mechanical apparatus to standardize critical aspects of early nineteenth- 
century music and the resulting debates surrounding what such standardization 
meant to the art. Did these machines hinder or enhance expression and Creative 
genius? Could they thwart the attempts of virtuosi to take liberties with the com- 
poser’s original intentions? Twentieth-century composers such as Gyórgi Ligeti, 
Edgard Várese, and Warren Burt used these same acoustical instruments to subvert 
the very notions they were created to define and reinforce. 

Standardization, the theme that ties together the various portions of this chap- 
ter, has been a critical component of Science and technology studies (STS) and 
recent works in the history of Science and technology (Frángsmyr, Heilbron, and 
Rider 1990; Olesko 1996; Schaffer 1992, 23-56; 1995, 135-72; Alder 1995; Gooday 2004; 
Galison 2003, 84-155). This chapter focuses on a different type of standardization, 
one of aesthetic qualities relevant to music, such as pitch and beat. Paradoxically, 
the physical instruments used to standardize music resulted in a plethora of pitches 
and tempi, thereby vastly increasing the richness of musical expression. Finally, this 
chapter addresses a topic often ignored by historians and sociologists of Science 
and technology, namely the importance of sound in eighteenth- and nineteenth- 
century workshops. Most historians of Science and technology, as well as STS schol- 
ars, have been guilty of fetishizing the visual at the expense of the aural. In so doing, 
they have missed an opportunity to investígate the importance of sound in early 
acoustical laboratories. 


Nineteenth-Century Physical 

Instruments 


The experimental history of all three instruments described in this chapter origi- 
nated in nineteenth-century workshops. The tuning fork, invented in 1711 by 
John Shore, sergeant trumpeter to England’s royal court, did not become a scien- 
tific instrument until the early nineteenth century. In 1802 Germán acoustician 
E. F. F. Chladni explained the principies of a tuning fork based on the longitudinal 
vibrations of a straight rod, which is bent in the middle. The tuning fork fits squarely 
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into Chladni’s research on vibrations and nodal points with a view to applying this 
knowledge to his invention of two musical instruments, the euphone and the clavi- 
cylinder. He also noted that the volume of a sounding tuning fork alternatively 
increased and decreased as one rotated it 360 o (Ullmann 1996, 102). 

In 1825 Ernst Heinrich Weber, Leipzig professor of anatomy, and his younger 
brother Wilhelm Eduard Weber, Leipzig professor of physics, turned to the tuning 
fork to explore the property of transverse vibrations, or ones that vibrate perpen- 
dicular to the direction of the propagating sound wave. They, like Chladni, were 
interested in the relationship of the volume of a tuning fork and the direction of the 
propagation of sound waves. They also employed the tuning fork for their investi- 
gation of condensed and rarified sound waves (Weber and Weber 1825, 507, 513). 
Their preliminary research on transverse vibrations led to Wilhelm Weber’s subse- 
quent research during the 1820S and early 1830S on the construction of compensated 
organ reed pipes, whereby the transverse vibrations of a vibrating reed compen- 
sated for the longitudinal vibrations of the air column, resulting in a constant pitch 
regardless of the air pressure blowing through the organ pipe (Jackson 2006, 114-34). 
In short, the Weber brothers and Chladni used tuning forks as physical instruments 
to study sound-wave propagation, interference, and musical-instrument design. 

Shortly after Wilhelm Weber completed his work on acoustics, J. Heinrich 
Scheibler, Krefeld’s leading silk manufacturer and opera aficionado, was working 
on his invention of the tonometer, which would revolutionize the tuning of pianos 
in equal temperament (Scheibler 1834, 53). His reputation began to spread through- 
out the Germán scientific community for the precisión of his tonometer, an appa- 
ratus that comprised fifty-four tuning forks, each of which pulsated at 4 vibrations 
per second (vps; later Hz) sharper than the previous one. It encompassed the range 
of one octave, from 220 vps to 440 vps and served as the model for future nine- 
teenth-century tonometers, including those of Georg A. I. Appun and Rudolph 
Koenig. He was also renowned among scientific (and, to a much lesser extent, musi- 
cal) circles for his attempt to standardize performance pitch. 

From 1840 onward, vibrations were often depicted graphically. Jean-Marie 
Constant Duhamel was one of the first physicists to realize that a Stylus or pin, with 
ink at its tip, when attached to a tuning fork, can record vibrations, which the pin 
sketches onto a piece of paper wrapped around a resolving drum. The drum makes 
one complete revolution in a precise, known period. The tuning fork is struck, and 
the pin makes sinusoidal curves on the paper. One counts the number of curves and 
determines the frequency of the tuning fork (Morton 2004, 2). This is the basic idea 
behind the kymograph, originally invented by Cari Ludwig in 1847 to monitor blood 
pressure (Kursell, this volume). 

In 1855 Jules Antoine Lissajous, French mathematician, experimental physicist, 
and professor of physics at the Lycée Saint-Louis, invented a method to visualize 
acoustic vibrations with the aid of tuning forks. He reflected a light beam from a 
small mirror attached to a vibrating tuning fork and then from a larger, rapidly 
rotating mirror onto a screen. After further study, he was able to generate his 
“Lissajous figures” by reflecting a beam of light from mirrors perched on top of two 
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vibrating tuning forks positioned perpendicular to each other. These curves are 
determined by the relative frequency, phase, and amplitude of the tuning forks 5 
vibrations as depicted on a screen (Jackson 2006, 210). 

Throughout the i86os French physicist Victor Regnault collaborated with the 
Prussian-born, Parisian instrument maker Rudolph Koenig on measuring the 
velocity of sound in the Parisian sewers. Regnault invented his own versión of 
Koenig's tuning-fork chronograph, which registered the arrival of a sound pulse 
with an electric signal at the beginning and end of the pulse. The tuning-fork chro- 
nograph could graphically depict with unprecedented accuracy the travel time of 
the sound wave (Pantalony 2009, 49). 

Tuning forks were at the center of a contro versy between Koenig and the doyen 
of Germán physics, Hermann von Helmholtz (Pantalony 2005, 57-82). Throughout 
the 1870S and i88os Helmholtz and Koenig engaged in a lively exchange about a new 
class of combination tones, or tones that are produced when two pitches are 
sounded simultaneously. Helmholtz called these new class of tones summation tones 
since they sounded at a frequency equivalent to the sum of the two primary pitches 
(Pantalony 2005, 62). These summation tones were much more difficult to hear 
than the other types of combination tones. Indeed, their existence was extremely 
controversial as they could not be heard by the unaided, untrained ear. Nonetheless, 
Helmholtz argued for their existence based on measurements obtained by resona- 
tors and membranes (Pantalony 2005, 62). Koenig, using his tonometer, carne up 
with a different rule for the generation of combination tones. Drawing upon the 
older “beat theory 55 originally proposed by Thomas Young in the early nineteenth 
century, Koenig argued that these tones were a rapid succession of beats that 
blended as a tone. Beat tones were his answer to Helmholtz’s combination tones. 
Koenig predicted and detected beat tones that were neither mathematically pre- 
dicted ñor observed by Helmholtz (see also Kursell 2009). In short, the tuning forks 
of the late nineteenth century became scientific instruments used to measure pitch 
precisely and describe the results of acoustic interference. 

Tuning forks, however, were not limited to the experiments of physicists; they 
also found their way into the work of nineteenth-century anatomists and physiolo- 
gists dedicated to elucidating the processes involved in aural perception. In 1827 
Charles Wheatstone, a London experimental philosopher and later professor of 
experimental physics at King 5 s College, discussed the effects of a tuning fork on the 
ear. The sound of a tuning fork whose base is placed on the forehead when the ears 
are blocked is louder than when the ears are open. If one ear is open and the other 
closed, the sound will be louder in the closed ear. If a tuning fork is placed above the 
temporal bone adjacent to a closed ear, the sound seems to be diverted to the oppo- 
site ear. Wheatstone concluded that since the volume of sounds communicated to 
the ear canal is increased, so too should the volume of external sounds acting on a 
cavity in a similar fashion (Wheatstone 1879, 30-32). “The great intensity with which 
sound is transmitted by solid rods, at the same time that its diffusion is prevented, 
affords a ready means of effecting this purpose, and of constructing an instrument 
which, from its rendering audible the weakest sounds, may with propriety be named 
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a microphone” (Wheatstone 1879, 32). In short, he discovered the effects of 
occlusion and lateralization. 

In 1834 Ernst Heinrich Weber described the same phenomenon based on the 
very same technique without referencing Wheatstone (E. H. Weber 1834, 2,5—44; 
Feldmann 1997, 321-22). A vibrating tuning fork is placed in the middle of the fore- 
head equidistant from a patient’s ears. The patient is asked to report in which ear 
the sound is louder. The sound should be heard equally loudly in both ears. If not, 
then the patient is either suffering from one-sided conductive hearing loss or uni- 
lateral sensorineural hearing loss due to an improperly functioning cranial nerve 
VIII. 1 This technique is now referred to as the Weber test. Eduard Schmalz, a 
Dresden physician and one of Weber’s students, was responsible for introducing 
the Weber test to the field of otology for diagnosing various aural disorders 
(Feldmann 1997, 322-23). In 1853 Heinrich Adolf Rinne, a Gottingen physician, 
developed a similar hearing test, which now bears his ñame. It is often used in con- 
junction with the Weber test for detecting sensorineural hearing loss. The stem of a 
vibrating tuning fork is placed on the mastoid bone. Patients then inform the physi- 
cian when they can no longer hear the tone. The physician notes the time and then 
immediately places the vibrating prongs of the fork next to the ear. A patient with a 
normally functioning cranial nerve VIII will hear the tuning fork for a longer period 
of time when placed next to the ear than when it is resting on the mastoid bone. The 
procedure is then repeated with the other ear 2 (Feldmann 1997, 324-325). 

A second instrument used by nineteenth-century acousticians to measure pitch 
and to understand the phenomenon of acoustical interference was the siren. In the 
1820S it was the preferred scientific instrument for precise pitch measurement in 
France. Whereas tuning forks sound due to the vibrations of the metal prongs in 
the air, the siren sounds as a result of rapid pulses of air. Since sirens produce sounds 
in ways distinct from rods and strings, they provided physicists and physiologists an 
opportunity to reconceptualize and understand the notion of tone. By the 1840S 
natural scientists were convinced that the sensation of tone was predicated upon 
periodic pulses transmitted to the auditory nerve. Henee, tone was now associated 
with periodicity rather than with the form of the vibration (Vogel 1994, 263). 

Originally invented by Edinburgh professor of physics John Robinson during 
the late eighteenth century and subsequently improved in design by French engi- 
neer Charles Cagniard de la Tour in 1819 (Robinson 1822, 403-405; de la Tour 1819, 
167-71; Robel 1891, 7-14), the siren is a thin disc, usually made of tin, with a series of 
concentric holes drilled around its circumference with a stationary pipe covering 
one of the holes. As the disc revolves around its axis, air travels through the pipe 
and also through the holes when they are directly over the mouth of the pipe, pro- 
ducing puffs of air. If one knows the number of revolutions of the wheel and the 
number of holes, the product of the two will be the number of puffs per second. De 
la Tour’s device of 1819 was the most accurate and commonly used siren of the early 
nineteenth century. 3 As Helmholtz was to argüe in his seminal Die Lehre von den 
Tonempfindungen of 1863, “this number is consequently far easier to determine 
exactly than in any other musical instrument, and sirens are accordingly extremely 
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well adapted for studying all changes in musical tones resulting from the alterations 
and ratios of the pitch numbers” (Helmholtz 1954, 12). 

In the late 1820S and early 1830S sirens were used in numerous physical and 
physiological investigations. The Parisian acoustician Félix Savart modified de la 
Tour’s siren. His new device, known as the Savart wheel, comprised toothed metal 
dises. The dises spin and rub against a card at the rim of the wheel. The frequeney 
of the pitch is directly proportional to the rate of the rotation of the wheel. He used 
this device to test the lower range of human hearing (Savart 1830, 337-52; 1831, 
69-74). During that same period, de la Tour turned to his siren to test the entire 
range of human hearing. He also employed the instrument to calcúlate wind speed, 
as the pitch of the siren was proportional to the velocity of the wind entering 
through the holes (Robel 1891, 16-17). In 1829 Pierre Louis Dulong used a de la 
Tour siren to confirm Pierre Simón Laplace’s equation for the speed of sound 
through air. 

In 1841 August Seebeck’s polyphonic siren, a siren that has several circles con- 
taining a different number of holes producing different pitches, was the instrument 
of choice to investigate phase relationship and interference generated by two or 
three tones. Historically, this is one of the most important and best- known applica- 
tions of the siren to test physical theories. Seebeck was interested in determining 
how the ear perceives nonisochronic pulses. In situations where pulses occur in 
successive intervals of t, t’, t, t\ he was able to distinguish two tones, one corre- 
sponding to t + t’, the other representing the octave (t + t’)/2. The relative intensity 
of the two tones depended upon the difference between the two intervals. He con- 
cluded from his siren experiments that a tone was “a nearly isochronic series of 
single pulses of arbitrary form” (Vogel 1994, 263; Seebeck 1841, 417-36). Two years 
later Seebeck’s conclusions were challenged by Georg Simón Ohm, who asserted 
that any sound could be expressed by a simple harmonic term in a Fourier series 
(Ohm 1843, 513-65; Jungnickel and McCormmach 1986, vol. 1, 268; Vogel 1994, 264). 
Seebeck responded by arguing for the importance of empirically researched sound 
perception, maintaining (contra Ohm) that the definition of a tone should not be 
limited to those that can be expressed as simple harmonic terms. Seebeck employed 
the polyphonic siren to produce tones for which the intensity of the vibrations cor- 
responding to the upper partíais could either be calculated mathematically or be 
distinctly heard 4 (Helmholtz 1895, 59). 

Metronomes are the third example of physical instruments used to standardize 
musical qualities, in this instance, beats. The chronometer was invented in 1656 by 
the Dutch natural philosopher Christiaan Huygens and patented the ensuing year. 
He drew upon the earlier work of Galileo Galilei, who had realized during the early 
seventeenth century that pendulums could be used as effective timekeepers. Galileo 
had determined experimentally that the period of any given pendulum is indepen- 
dent of the weight of the plumb bob and the amplitude and that the square of the 
period varíes directly with the length of the pendulum. Clearly, seconds pendu- 
lums, or those with a period of two seconds (one second per direction) were critical 
to eighteenth- and nineteenth-century astronomy and navigation. The metronome, 
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which in essence is a modified chronometer, employed a compound pendulum 
pivoting around a central point with a fixed weight at the bottom and an adjustable 
weight at the top (Levin 1993, 83). It was introduced as a scientific instrument in the 
1830S, when Scheibler used it while constructing his tonometer. As mentioned ear- 
lier, Scheibler’s tonometer, which comprised fifty-four tuning forks made by the 
Krefeld instrument maker Hermann Kámmerling, encompassed the octave from 
220 to 440 vps. With a modified metronome made by Kámmerling, Scheibler now 
possessed a precise measurement of beats produced by adjacent forks. Kámmerling 
also created a tuning fork for the tonic, its octave, and each semitone in between 
(Jackson 2006, 162-63). 

Much like tuning forks, metronomes were also used in physiological investiga- 
tions of the mid-nineteenth century. The Dutch physiologist Franciscus Cornelius 
Donders employed metronomes, tuning forks, and a kymograph in his research on 
physiological time. The subject sat at a table with an induction coil and several elec- 
tric switches. Electrodes were placed on the subject’s foot at the anklebone and on 
the skin of the groin. The subject operated the switches in response to light electric 
shocks generated by the investigator. A metronome was used to determine the reac- 
tion time (Schmidgen 2005, 225). Donders also employed a metronome to measure 
the rhythm of heartbeats (Schmidgen 2005, 229). In short, much like the tuning 
fork, the metronome was an instrument central to nineteenth-century physical and 
physiological research. The borders between music, physics, and physiology were 
rather porous. 


The Standardization of 
Nineteenth-Century Musical Practice 


Tuning forks and metronomes were clearly not restricted to nineteenth-century 
acoustical workshops and laboratories; they were also critical to musical practice. 
During the late eighteenth century, chaos apparently reigned in performance 
pitches. In 1791 flautist J. G. Tromlitz had remarked that “the pitch of all places is 
not the same but sometimes varíes up to a semitone higher or lower” (translated in 
Powell 1791 and cited in Haynes 2002, 313). The lowest Viennese tuning fork had still 
been a semitone sharper than the Leipzig pitch (Haspels 1987, 122; Haynes 2002, 
338). Not only did different countries possess different pitches for a\ but it was a 
rarity to find orchestras even in the same city sharing the same pitch. For example, 
in París, the pitch used by the Grand Opera for a ranged between 427 in 1811 to 434 
in 1829, while the Italian Opera preferred 424 in 1823 (Ellis in Helmholtz 1954, 
495-500). The pitch used by the Dresden opera began to rise, albeit rather slowly, in 
1821 and reached 435 some time between 1825 and 1830, while the organ of the Román 
Catholic Church in Dresden sounded at 415 in 1824 (Ellis 1880, 310, 318). 
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In addition to the myriad of concert pitches throughout Europe, a gradual 
sharpening of the various concert pitches was well under way by the 1820S. As early 
as 1802 Heinrich Koch had noted in his seminal Musikalisches Lexikon that 
Kammertóne were gradually rising (Koch 1802, 822; Haynes 2002, 312). Generally, 
with the formation of large orchestras with a greater variety of instruments and 
musical scores written to accommodate such orchestras, tonal color catered to the 
higher instrumental range rather than the human voice, or the vox humana , as had 
been the case during the Middle Ages (Ellis 1880, 309-10). The rapid rise in pitch 
was perhaps greatest in the Germán territories. Whereas Kammertóne throughout 
Prussia and Saxony had been consistently lower than those in Vienna and París 
during the early years of the nineteenth century, by the 1830S Germán pitches were 
slightly higher than in most French cities and Vienna (Haynes 2002, 349). According 
to the Romantic music critic G. L. P. Sievers, by 1817 “the pitch of the three great 
Parisian orchestras is more than a semitone higher than the highest in Germany 
and Italy. The purely instrumental groups, where no singing is involved . . . [use a] 
tone [that] is even higher” ( Allgemeine musikalische Zeitung [henceforth AmZ] 19 
(1817), 302; Haynes 2002, 330). In the early 1830S, however, J. Heinrich Scheibler 
took several tuning forks from Berlín, Dresden, and Paris, and demonstrated 
that the tuning fork from the French capital possessed the lowest pitch. The pitches 
of these forks varied as much as three-quarters of a full tone. In the Allgemeine 
musikalische Zeitung a concerned reader lamented the “excessively high pitch” of a 
number of Germán orchestras (AmZ 37 (1835), 206; Haynes 2002, 349). In September 
of 1834 Heinrich Scheibler presented his work on the determination of the pitch for 
a at 440 vps (or Hz). After conducting several experiments with various pitches 
used in Paris, Berlín, and Vienna, Scheibler decided to choose his a at 440 as 
the middle of the extremes between which the pitch of Viennese pianos rises 
and falls due to change in temperature. The pitches of these pianos were deter- 
mined by a monochord, and the pitch 440 vps was checked by his tonometer. This 
precise valué was approved as the official, national Germán pitch by the physics 
section of the Versammlung deutscher Naturforscher und Aerzte in Stuttgart in 
September 1834. 

Variations in pitch did not merely alter the aesthetic effect of pieces but also 
had an anatomical consequence. Vocalists in particular were distressed by the lack 
of pitch standardization. Throughout the first four decades of the nineteenth cen- 
tury, th e Allgemeine musikalische Zeitung often reported the various diatribes against 
the rising pitch (AmZ 4 [1801], 76; 5 [1803], 529-35; 16 [1814], 772-76; 31 [1829], 
285-94; 37 [1835] , 205-207). Sopranos in particular protested vehemently: The prima 
donnas carried along tuning forks to performances and insisted (with limited suc- 
cess) that host orchestras tune to them (AmZ 3 [1801], 76; Haynes 2002, 333). In 1814 
the philosopher C. F. Michaelis, professor of the philosophy of aesthetics at Leipzig, 
and J. G. Schicht, cantor of the Thomasschule in Leipzig, had pleaded for a common 
pitch for singers and orchestras throughout Europe. They argued that the necessity 
of singers to adjust to orchestral pitches, which differed as much as a full tone, was 
both exhausting and deleterious to their health (Michaelis and Schicht 1814, 772-76). 
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The Parisian prima donna Alexandrine Caroline Branchu forced the opera to lower 
its pitch in the early 1820S as she feared the high pitch would result in a premature 
loss of her voice (Haynes 2002, 330). Arias that required the mastery of very high 
pitches, as well as pieces that contained falsettos, were particularly difficult as the 
singers touring Europe were forced to readjust their ears and vocal cords to the 
particular a that the orchestra chose. The problem became particularly acute in 
the mid-nineteenth century, when vocalists and instrumental virtuosi toured 
Europe at a feverish rate. Standardization was both an aesthetic and a physiological 
necessity. 

Tuning forks were the instrument of choice when Europeans decided to stan- 
dardize concert pitch. In 1859 a French commission that comprised composers, 
including Héctor Berlioz, Gioacchino Rossini, and Giacomo Meyerbeer, and phys- 
icists, including Jules Lissajous, professor of physics at the Lycée Saint-Louis, and 
César-Mansuéte Despretz, sought a French concert pitch. During his research, 
Lissajous investigated the pitches of numerous tuning forks that musicians through- 
out Europe had sent him. He employed his technique of using Lissajous figures 
(discussed earlier) to determine the pitch of each fork precisely and accurately. 
When the French commission decided on a standard pitch of 435 Hz, it had the 
diapasón normal constructed that sounded at that pitch (Jackson 2006, 207-14). 
Similarly, when representatives from six European nations traveled to Vienna in 
November of 1883 to determine a European pitch, two physicists, Vienna’s profes- 
sor of physics Josef Stefan and Rome’s professor of experimental physics Pietro 
Blaserna, informed the international gathering of the necessity of tuning forks 
for keeping a steady pitch. Their pitches, unlike those of organ pipes, were not 
greatly influenced by changes in temperature. Henee, the tuning fork remained the 
physical instrument of choice to convey the proper pitch to European musicians. 
National scientific and technological bureaus, such as the Physikalisch-technische 
Reichsanstalt, were required to produce tuning forks that sounded at 435 Hz 
(Jackson 2006, 218-29). 

What in the seventeenth century had started as isolated calis for mechanical 
instruments to more accurately count the beats of a composition culminated in a 
deafening crescendo by the end of the eighteenth century. In response, musicians 
and mechanicians attempted to construct portable chronometers. Magdeburg 
organist and cantor G. E. Stóckel argued that chronometers were necessary for 
beginners interested in learning to play music, as well as for those Liebhaber (ama- 
teurs) who do not play professionally but simply for their enjoyment (Stóckel 1800, 
658). In addition, Stóckel underscored the importance of a correct, standard mea- 
sure of time for compositions: “[T]he advantage of an as yet unknown, absent, 
generally accepted standard for musical time, or a chronometer, according to which 
this time is precisely determined . . . will be clearly recognizable” (Stóckel 1800, 
658). The chronometer was, for Stóckel, the ally of the composer, whose “spirit” 
would be preserved (Stóckel 1800, 660). He also hoped that chronometrical markings 
might ensure a beat based on the universal standard of time, replacing the constant 
referral to vague terms such as andante , allegro , and presto (Stóckel 1800, 677). 




210 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


He argued that the chronometer was just the device to assist pupils, a who by nature 
do not at first possess a sensitive and sure feeling of the beat (Taktgefühl) to sustain 
the most exact, rhythmical practice of the pieces” (Stóckel 1800, 679). The chro- 
nometer could provide mechanically what the performer lacked organically. In 
1806 the piano instructor F. Guthmann penned an article describing his idea for a 
Taktmesser. He expressed his concerns about using a chronometer as a teaching 
tool, which students followed throughout the practicing of the entire piece, as this 
was anathema to “the spirit of true music” (Guthmann 1806, 118). He added, “ [The] 
greatest artistes have also proven that they could not play folio wing such a mechan - 
ical instrument. It repulsed their sensitivity, and they deviate from it involuntarily” 
(Guthmann 1806, 118). 

During the early nineteenth century, not coincidentally the period that 
witnessed an increase in individual interpretation, improvisation by performers, 
particularly the virtuosi, gave rise to what some would see as the mechanical 
tyranny of precisión. In the second decade of the nineteenth century, debates 
began to rage within Germán musical circles as to whether chronometers or 
timepieces should be used. Critics noted that the mathematical tempo correspond- 
ing to chronometer markings might simply be too mechanical. Precisión just might 
not be all that beneficial. By not divulging the exact tempo markings for the piece, 
the composer allowed for individual interpretation. Mechanical control was 
diametrically opposed to the art of music — a theme echoed by a number of 
performers. 

Perhaps the most informed piece on the subject was written by Gottfried 
Weber. Born in 1779, Weber had studied law at Góttingen, subsequently taking 
up a position in the imperial court of law in Wetzlar. He later became a tribunal 
judge in Mainz and remained in that position until 1818, when he was elected 
court legal advisor at Darmstadt, where he was named to a legal commission in 
1825 that drafted the civil laws of the región. His civil and legal accomplish- 
ments, however, belie his undying passion for music. An accomplished cellist 
and flautist, he learned much from his friend, the acclaimed composer Cari Maria 
von Weber. Gottfried Weber, who founded Caecilia , an important journal for 
musicians and connoisseurs of music, authored various books and essays on 
a myriad of topics, including critiques of Mozart’s compositions, a theory of tonal- 
ity, the acoustics of wind instruments, the improvement of musical-instrument 
design, and the physiological properties of the human larynx (Schilling, vol. 6 
[1838], 832-33). 

Weber set up his essay on the chronometer in the form of a dialogue between a 
composer and a music director (G. Weber 1813, 441-47). The composer wishes to 
convince the music director that chronometers and timepieces are necessary for the 
proper performance of a composition. The music director, sensing that he is losing 
his authority to interpret the piece, protests staunchly. After the music director 
argües speciously against using new instruments to determine the beat by drawing 
upon classic authorities such as Sébastien de Brossard’s Dictionnaire de musique , 
the crux of the argument ensues. The music director first condemns the use of 
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mechanical instruments that make noise while keeping the beat. The composer, 
however, wholeheartedly agrees. The music director openly disputes the use of a 
chronometer to replace his conducting duties (G. Weber 1813, 443). In a sense, 
the machine would be too perfect for the imperfect art of music and for those 
who perform it. Once the composer assuages the music director’s fear of 
replacement, he continúes the conversation: “I was never of the opinión that a 
chronometer would be suitable for directing music. No lifeless or insensitive 
machine would ever be suitable for that” (G. Weber 1813, 443). For the composer 
(who represents Weber’s own views here), the chronometer has the unique func- 
tion of serving as an interpreter between the composer and the performers. 
The composer continúes: “It is merely a standard of measure for the composer to 
be able to show the performer or the director of large musical pieces at exactly 
which tempo he wishes to have his work performed” (G. Weber 1813, 444). The 
composer simply needs to provide the music director and performers with the 
length of the pendulum, which produces the required time interval. The music 
director, by the conclusión of the dialogue, is convinced of the chronometer’s 
importance. 

The early chronometers, as ingenious as they might have been, never proved to 
be nearly as successful as Johann Nepomuk MálzeFs metronome. MálzeFs metro- 
nome, which he pirated from the renowned Dutch mechanician Diederich Nicolaus 
Winkel, possessed three advantages over the earlier chronometers. First, it was 
much smaller, more portable, and easier to use. Second, it was based on the number 
of beats per minute, a measure of time (unlike a measure of length), which every 
country recognized and henee was the best measure possible for standardizaron. 
Third, it was relatively inexpensive. Indeed, Málzel himself made two hundred met- 
ronomes and sent them free of charge to the leading musicians and composers of 
the period: a clever business tactic to ensure that his device was the only one used 
for counting beats at various tempi (Harding 1983, 27-28). 

Composers joined the ever-increasing number of supporters of MálzeFs 
mechanical contraption. They included Gaspare Spontini, Antonio Salieri, Muzio 
Clementi, Giovanni Battista Viotti, Luigi Cherubini, J. N. Hummel, Conradin 
Kreutzer, Ludwig Spohr, and Ignaz Moscheles. More important, we are told that 
“the aforementioned masters have promised to note all their future compositions 
according to the scale of MálzeFs metronome in order to confront every argument 
that can arise over the speed of the movement” (Wiener Allgemeine musikalische 
Zeitung , February 6, 1817). 

In 1817 MálzeFs metronome made its grand appearance and received strong 
approval from an anonymous author in the Allgemeine musikalische Zeitung 
who expressed the hope that MálzeFs machine would free musicians and compos- 
ers from “such multiheaded anarchy, which rules over these matters” ( AmZ 
19 [1817], 417-18). The metronome was easy to use as it did not presuppose 
any mathematical or scientific knowledge on the part of the musician, and it had a 
range of 50 to 160 pendulum swings per minute. The Quarterly Musical Magazine 
and Review of 1821 summed it up perhaps best: The metronome’s “divisions are 




212 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


thereby rendered intelligible and applicable in every country: a universal standard 
measure for musical time is thus obtained, and its correctness may be proved 
at all times by comparison with a stopwatch” ( Quarterly Musical Magazine and 
Review 3 [1821], 303). By 1824 the metronome had become part of Ufe for musicians 
and composers in Britain: “[A] metronome must be referred to in order to ascer- 
tain the movement of the adagio or the allegro” (Cutler and the Editor 1824, 
31-33). 

Beethoven’s first works with Málzel’s metronome markings were his string 
quartets op. 18, 59, 74, and 95 and the piano sonata op. 106. He then added his mark- 
ings to all the movements of his symphonies (Wehmeyer 1993, 52; Stadlen 1979, 
12-33). Beethoven did his best to forcé his will on the performers: “Maelzel’s metro- 
nome gives us an excellent opportunity to do so [i.e., ensure that the composer’s 
intensions were followed]. I give you my word for it here, in my further composi- 
tions I shall not use those terms. . . . That one will cry out ‘tyrants, 5 I do not 
doubt . . . It would still be better than us being accused of feudalism” (Hamburger, 
1951, 16; Leonhardt 1990, 146). Beethoven realized the issue was quintessentially one 
of control. He was particularly angered by the virtuosi, who constantly took liber- 
ties with the tempi of his piano pieces (Haupt 1927, 129; Schindler 1840, 212-24). 
Being infamously a rather whimsical creature, he withdrew his support once he and 
Málzel became embroiled in a lawsuit over the ownership of the “Battle of Vittoria,” 
calling the metronome a “dumb thing; one must feel the tempi” (Hamburger, 1951, 
161). This view was the major objection to using the metronome. Rhythm should 
somehow be organic; one needed to feel it from within, not copy it from an 
external source. In addition, Beethoven famously mocked the metronome in the 
second movement of his Eighth Symphony. This sentiment was echoed by 
Sievers, who wrote in September 1819 that the metronome was an “error of the 
human spirit because music cannot be impro ved by mechanical inventions” (Sievers 
1819, 399). He felt that the performer could use a different tempo; henee, the metro- 
nome was merely a triviality ( Spielerei ) (Sievers 1819, 600). He thought it strange 
that Mozart and other renowned composers were content to compose their pieces 
without fearing that the tempi of their compositions would be violated (Sievers 1819, 
600). According to Sievers, a “philosophical” point was at issue: “A piece of music 
stops being a product of its composer the moment it is performed by an artiste; 
then this artiste is its second creator or something much more: an animator of the 
work, transferring his living spirit into the dead form of the composition” (Sievers 
1819, 600). 

Sievers’s opinión notwithstanding, Málzel continued to tinker, subsequently 
impro ving the design of his metronome in 1818 and 1832 (Anón. 1836, 10). In 1836 an 
anonymous pamphlet boasted two important attributes of the device. First, it 
gave the composer control over the performance of the piece. Second, it helped 
beginners learn the correct “feeling of the beat” (Taktgefühl) (Anón. 1836, 5). 
The mechanical metronome helped discipline both the youthful beginner and the 
capricious, unbridled performer. 



FROM SCIENTIFIC INSTRUMENTS TO MUSICAL INSTRUMENTS 


213 


Twentieth-Century Musical 

Performance 


In the twentieth century, tuning forks continued to be the subject of research in 
physics and engineering. The use of quartz tuning-fork resonators is still rather 
popular in a range of scientific instruments, such as optical microscopes, atomic 
and magnetic-force microscopes, high-sensitivity magnetometers, atomic point 
contact sensors, and spectroscopic gas sensors (Barbic, Eliason, and Ranshaw 2007, 
524). Quartz tuning-fork resonators are quite small: The tiñes are about 1.5 mm in 
length and about 100 pm in both width and thickness. Such dimensions permit the 
tuning fork to be highly sensitive forcé detectors (Barbic, Eliason, and Ranshaw 
2007, 525). In the Accutron watch marketed by Bulova, Max Hetzel employed a 
360-Hz Steel tuning fork powered by a battery, thereby greatly increasing the accu- 
racy of the watch. The tiny quartz crystal used in modern quartz watches is made in 
the shape of a tuning fork. The properties of quartz cause the tuning fork to genér- 
ate a pulsed electric current, which is detected by the Computer chip in the watch to 
keep track of the time. 5 Tuning forks are also still used in physics and engineering 
courses to teach dampened oscillatory motion (Wolfson and Pasachoff 1999; Serway 
1996). They are also employed in primary hearing tests in the same manner dis- 
cussed earlier in this chapter. In addition, they can be used to test for broken bones 
in the absence of an X-ray machine. If a sounding fork placed on a bone causes 
pain, the bone has most likely suffered a fracture (see Rice, this volume). 

During the twentieth century, the tuning fork reentered concert halls, this 
time as a performance instrument rather than a mechanical device to assist one’s 
technique in private. From 1985 to 1987, the American-born Australian musician 
Warren Burt composed “Music for Tuning Forks.” Burt is known for composing in 
a wide variety of new musical styles, including acoustic music, electro-acoustic 
music, sound art installations, and text-based music. He constructed the aluminum 
tuning forks himself in the autumn of 1983 while an artist in residence at the 
Australian Commonwealth Scientific and Industrial Research Organisation’s 
(CSIRO) National Measurement Laboratory at Monash University under the 
Australia Council, Artists and New Technology Program. 6 He tuned his forks, 
which were both treble and bass, to a nineteen-note, just-intonation scale derived 
from Claudius Ptolemy’s Harmonics of the second century ce . 7 

As an experimental composer, Burt tinkers with múltiple versions of the same 
pitch. His work is a prime example of microtonal music — music that uses pitches 
smaller in interval than the twelve Western semitones per octave — which explores 
the sounds generated from nineteen to fifty-three pitches to the octave. For his CD 
TheAnimation ofLists and theArchytan Transpositions , Burt wished to expand this 
scale without making fifty-three tuning forks. His solution was to transpose recorded 
tuning forks in a Computer, creating an additional number of pitches that could 
not be generated by the tuning forks themselves. His preferred interval for the 
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transposition carne out to be a the 28/27 Archytas’ large 63 cent versión of the 
quarter-tone.” 8 Henee, with a Computer he was able to achieve what Scheibler, 
Koenig, and others had more than a century earlier with a tonometer. In essence, 
echoing the sentiments of numerous twentieth-century avant-garde composers, 
Burt wishes to challenge equal-temperament tuning, in which each semitone is the 
twelfth root of two times sharper than the preceding pitch. Since his work “Music 
for the Tuning Forks,” he has written numerous pieces using tuning forks, comput- 
ers, and even live-interactive Computer pieces. Interestingly, whereas Scheibler used 
his tonometer of tuning forks back in 1834 to assist piano tuners to tune in equal 
temperament, Burt uses tuning forks to explore creatively other possible tempera- 
ments. One music critic noted, “In Warren Burt’s hands, these tuning forks become 
some strange new instrument, complete with its own exotic tuning system.” 9 

Similarly, the siren also became a performance instrument of the twentieth 
century even though it had previously been solely a physical instrument, unlike the 
tuning fork and the metronome. Indeed, it is present in many more compositions 
than the tuning fork and the metronome. The avant-garde French composer Edgard 
Várese employed sirens in a number of his compositions, most notably in Ionisation , 
which debuted in 1931. Várese was fascinated by the intersection among Science, 
technology, and music. He studied Helmholtz’s Die Lehre von den Tonempfindungen 
while experimenting with sirens and whistles (Wen-Chung 1966, 165). In 1913 he 
had met René Bertrand, inventor of the Dynaphone, with whom he would go on to 
experiment on the possibilities of electronics as a musical médium. Three years 
after his initial meeting with Bertrand, Várese argued, “Our musical alphabet must 
be enriched. We also need new instruments badly . . . In my own works I have 
always felt the need of new médiums of expression . . . which can lend themselves 
to every expression of thought and can keep up with thought” (quoted in Wen- 
Chung 1966, 165; originally appeared in the New York Morning Telegraph , 1916). In 
1922 he suggested that “The composer and the electrician will have to labor together 
to get it” (quoted in Wen-Chung 1966, 163; originally appeared in the Christian 
Science Monitor, 1922). 

To develop an electronic machine for composing, in 1927 Várese collaborated 
with Harvey Fletcher, who was the acoustical research director at Bell Telephone 
Laboratories at the time. From 1932 to 1936 Várese applied annually for a Guggenheim 
Fellowship “to pursue work on an instrument for the producing of new sounds” 
(quoted in Wen-Chung 1966, 165). Sadly, the Guggenheim Foundation rejected 
his application each time. As a result, he conducted only a number of low-level 
experiments with phonograph turntables with motors set at different speeds. 

From 1929 to November 1931, while working on Ionisation , Várese had been 
contemplating the establishment of a music laboratory in which students would be 
able to compose music. Although the laboratory never carne to pass, one can nev- 
ertheless see that he was preoccupied with scientific instruments while composing 
Ionisation. He was particularly fascinated by the Science of sound and timbre and 
relentlessly sought out new musical instruments (Ouellette 1968, 103). Much like 
Beethoven and others of the early nineteenth century who used machines such as 
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the metronome to prevent the virtuosi from taking unrestrained liberties with their 
compositions, Várese proclaimed, with much bravado: 

The performer, the virtuoso, ought no longer to exist: he would be better 
replaced by a machine, and he will be. We shall find new intensities, too, for the 
realm of sound has still been very incompletely explored. These ideas still shock a 
great many people, but you will see them become realities in the more or less 
distant future . . . The composer will have improved, more flexible means at his 
disposal with which to express himself. His ideas will no longer be distorted by 
adaptation or performance as all those of the classics were. (quoted in Ouellette 
1968, 106) 

In 1930 Várese took part in a discussion on the future of the mechanization of 
music. His revolutionary views shocked many (Ouellette 1968, 104-105). Echoing 
the desires of Italian composer Luigi Russolo (Bijsterveld 2002, 121), Várese went on 
to express his hope that the innovative mechanical system would allow new tim- 
bres, intensities, and frequencies, furthering the work of acousticians on the limits 
of human hearing. 

Ionisation was composed for thirteen musicians playing thirty-seven percus- 
sion instruments, including two sirens, one possessing a high pitch, the other a low 
one, both of which were borro wed from the New York City Fire Department 10 
(Ouellette 1968, 105). The fire sirens, which Várese also used in subsequent compo- 
sitions, created a glissando effect through a myriad of frequencies without ever rest- 
ing on any one in particular, thereby using them in a manner quite different from 
the practices of nineteenth-century acousticians. 11 Rather than implementing a 
siren to provide a steady pitch as acousticians and physicists had in the nineteenth 
century, Várese stressed the instrumentas ability to elicit numerous tones with 
various pitches throughout an octave. Sirens in twentieth-century compositions 
were not meant to increase precisión but to abolish tonal increments in music. As 
Wen-Chung argües, Ionisation is a classic work “because it demonstrates that 
Varése’s concept is successfully applicable even when no definite pitches are pres- 
ent, the supreme test for his goal of liberating sound” (Weng-Chun 1966, 161, 163). 
It premiered at New York’s Carnegie Hall on March 6, 1933, under the direction of 
Nicolás Slonimsky, to whom the piece was subsequently dedicated. Varése’s 
concept was that of the “process of atomic charge as electrons are liberated and 
molecules are ionized.” 12 As the composer later divulged when reflecting on this 
work, “I was not influenced by composers as much as by natural objects and 
physical phenomena” 13 (Schuller 1965, 34). 

In the twentieth century the metronome also became a performing instrument 
to test the boundaries of the well-established classical musical qualities of rhythm. 
Avant-garde composers toyed with the metaphors of machines, docks, precisión, 
and standardization. The Hungarian Jewish composer Gyórgy Ligeti, born in 
Transylvania, Romanía, was a pioneer of electronic music who enjoyed challenging 
well-established musical boundaries. Wishing to remove himself from the classic- 
Romantic tradition, the young Ligeti embraced the electronic music of Karlheinz 
Stockhausen and Gottfried Michael Koenig “with regard not only to sonority but 
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also form, unfolding, the ‘flow’ of the music. It was a liberation for me — a libera- 
tion from this thinking in bars, this measured time” (quoted in Griffiths 1997, 22). 
He developed the musical technique of micropolyphony , a form of twentieth-cen- 
tury musical texture involving the use of sustained dissonant chords that shift slowly 
over time, or “a simultaneity of different fines, rhythms, and timbres.” 14 Ligeti him- 
self described micropolyphony as “[t]he complex polyphony of the individual parts 
[that] is embodied in a harmonic-musical flow, in which the harmonies do not 
change suddenly, but merge into one another; one clearly discernible interval com- 
bination is gradually blurred, and from this cloudiness it is possible to discern a new 
interval combination taking shape.” 15 

Much of his musical compositions dealt with time, its measurement, and its 
precisión. He spoke of c< the spatialization of the flow of time, creating a continuous 
present,” adding that “it would be much more worthwhile to try and achieve a 
compositional design of the process of change” (Griffiths 1997, 32-33). Perhaps the 
best example of his depiction of precisión and the spatialization of the flow of time 
was his Poéme symphonique for 100 Metronomes of November 1962. Ligeti himself 
concedes that the origin of this work dates back to a story with which he had been 
obsessed as a youth. A widow, whose husband has been dead for years, lives in a 
house full of docks constantly ticking away. The walls are bedecked with barome- 
ters and hygrometers. “The meccanico-type music really originates from reading 
that story as a five-year-old, on a hot summer afternoon” (quoted in Griffiths 1997, 
40). “Nobody comes, maybe for a hundred years. Nothing happens. So there is a 
combination of movement, which is machine-like, and absolutely nothing . . . 
a timelessness ... no beginning and no end” (quoted in Steinitz 2003, 8). 

Ligeti’s Poéme symphonique is seen as his final fluxus composition (the fluxus 
movement was an international network of artists, composers, and designers 
who were noted for blending different artistic media and disciplines in the 1960S 
and were active in neo-Dada noise music and art, literature, urban planning, 
architecture, and design). 16 The piece debuted at the annual Gaudeamus Music 
Week in the Hilversum City Hall in Holland in September 1963. Ligeti’s piece, which 
was an experiment in indeterminate rhythmic counterpoint, furthered his explora- 
tion of micropolyphonic textures. It offered the interest of individual clockwork 
mechanisms heard together and gradually thinning out. 

Ligeti himself proffered detailed instructions on how he wished to have his 
piece performed. 17 The metronomes are played by ten players following the con- 
ductor’s lead. Each player is responsible for operating ten metronomes, which are 
placed on resonators and brought onto the stage completely rundown. The metro- 
nomes are arranged in groups of ten around a microphone connected to a loud- 
speaker. The conductor gives the signal, ordering the players to wind up the 
metronomes on stage. The speeds of the pendulums are set: Within each group, 
each metronome beats out a unique tempo. Two versions of the performance are 
possible, and the conductor chooses one of the options before the performance. 
Either all the metronomes are wound equally tightly, or several metronomes in a 
group are wound unequally. The conductor also determines the interval of silence 
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(between two and six minutes) from the end of the preparation to the commence- 
ment of the performance. Another signal from the conductor instructs the per- 
formers to set all of their metronomes in motion simultaneously. a To carry out this 
action as quickly as possible, it is recommended that several fingers of each hand be 
used at the same time .” 18 With practice, a set of four to six metronomes can be set 
into motion simultaneously. Once the performance starts, the performers and the 
conductor immediately leave the stage, allowing the audience to observe. The 
conductor determines the duration of the pause at the end of the performance 
(i.e., from when the last metronome runs down completely to the moment when 
the conductor returns to the stage with the players in order to receive their 
applause ). 19 Sadly, such applause was never realized during the premiere in 
Hilversum, Holland. As Ligeti recalls many years later, 

Following the official speeches, the premiere proceeded according to plan. Since 
the audience had never heard of Fluxus, and since John Cage, too, was as yet 
completely unknown to the invited nobility and citizens of Fíilversum, the last 
tick of the last metronome was followed by an oppressive silence. Then there 
were menacing cries of protest, (quoted in Steinitz 2003, 128) 

The concert was scheduled to be televised later that week. “I was still Walter Maas’s 
guest in the curious Gaudeamus Foundation house, which was shaped like an open 
grand piano — we sat in front of the televisión awaiting the scheduled broadcast of 
the filmed event. But, instead, they showed a football game . . . the programme had 
been prohibited at the urgent request of the Hilversum Senate” (quoted in Steinitz 
2003, 128). 

The superimposition of pulsation grids produced by the metronomes (which 
Ligeti referred to as a moiré effect in physics, or an interference pattern created, for 
example, when two grids are overlaid at an angle or when they have slightly differ- 
ent mesh sizes) would be used later on in numerous of his compositions. At the 
beginning of the Poéme symphonique , the grids are so numerous that they coalesce, 
sounding disorderly and blurred. Ligeti added: 

As soon as some of the metronomes have run down, changing rhythmic patterns 
emerge, depending on the density of the ticking [i.e., the number of metronomes 
and their speeds]; until, at the end, there is only one, slowly ticking metronome left, 
whose rhythm is then regular. The homogeneous disorder of the beginning is called 
“maximal entropy” in the jargon of information theory (and in thermodynamics). 

The irregular grid structures gradually emerge, and the entropy is reduced since 
previously unpredictable ordered patterns grow out of the opening uniformity. 

When only a single metronome is left ticking in a completely predictable manner, 
then the entropy is maximal again — or so the theory goes. (quoted in Steinitz 
2003, 129) 

As years went on, Ligeti altered the performance of the piece. No longer insisting 
on the fluxus ceremony, he argued that “the metronomes should be started 
before the audience enters the concert hall, so that the piece truly runs like a 
machine: metronomes and audience are confronted with each other without any 
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human mediation” (quoted in Steinitz 2003, 129). The performers had completely 
vanished. 


Conclusión 


In the twentieth century, the metronome and the tuning fork were transformed 
from the invisible to the visible. What do I mean? Throughout the eighteenth and 
nineteenth centuries these physical instruments were seen as mechanical devices 
used to assist the musician and the composer by standardizing musical pieces. 
Recall the debate generated by the music director and composer in Gottfried 
Weber’s essay on the chronometer. The virtuoso was center stage. Clearly the met- 
ronome and the tuning fork were never to be used during a performance. In addi- 
tion, debates aróse as to whether mechanical devices deprived the art of music of its 
inherent expression of freedom. During the twentieth century, composers decided 
to draw upon these instruments to explore the various notions of precisión, whether 
it be pitch or beat. Whereas Scheibler’s tonometer was seen as a device to standard- 
ize and thwart the individuality of various temperaments, Burt’s tuning forks 
empowered him to explore forms of Creative expression using microtonalities. In a 
very real sense, avant-garde composers such as Ligeti and Burt provoked their lis- 
teners to rethink the very musical notions that metronomes and tuning forks had 
originally been created to reinforce centuries earlier, namely tonality and rhythm. 
Similarly, the sound generated by sirens in nineteenth-century acoustical work- 
shops and physics laboratories migrated to the concert hall throughout the United 
States and Europe. No longer committed to generating a constant pitch, they were 
now performance instruments used to increase the types of pitch available to imag- 
inative composers. They were now employed to thwart the precisión of pitch. 
Várese wished to stress the range of micro tonality that machines could produce, 
and in the process he rendered the virtuoso superfluous. The mechanical now did 
not merely threaten to quash the creativity of the musician interpreting the piece; 
in a number of cases it also became the performer. 


NOTES 


1 The efficacy of this test is rather controversial, however; http://www.fpnotebook. 
com/ENT/Exam/TngFrkTsts.htm (accessed Dec. 28, 2009). 

2 Franciscus Cornelius Donders and Hermann von Helmholtz used tuning forks in 
physiological and anatomical experiments. See Schmidgen 2005, 211-56) and Kursell (this 
volume). 

3 It was commonly used by French and British experimental natural philosophers 
during the 1820S, while Germán experimentalists began to use it in the mid-i830s (Jackson 
2006, 172, 323). 
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4 Although the importance of the siren as an acoustical instrument began to wane 
after the Ohm-Seebeck dispute, it was still the instrument of choice for the study of the 
combination of tones, the ratios of consonances, and interference (Robel 1891, 22; 
Pantalony 2005, 60; Helmholtz 1954, 162-63, 182). 

5 Www.Absoluteastronomy.Com/Topics/Tuning_Fork (accessed Dec. 28, 2009). 

6 Warren Burt, “Nonpop New Music Composer”; http://kalvos.org/burtwar.html 
(accessed Dec. 28, 2009). 

7 “Warren Burt: The Animation of Lists and the Archytan Transpositions”; http:// 
home.swipnet.se/sonoloco26/xi/burt.html (accessed Dec. 28, 2009). 

8 Http://Home.Swipnet.Se/Sonoloco26/Xi/Burt.Html (accessed Dec. 28, 2009). 

9 Http://Www.Forcedexposure.Com/Artists/Burt.Warren.Html (accessed Dec. 28, 
2009). 

10 Http://dic.academic.ru/dic.nsf/enwiki/668999 (accessed Dec. 28, 2009) and 
“Varése’s Ionization: The Percussion Revolution,” http://www.scena.org/lsm/sm5-10/ 
percu-en.html (accessed Dec. 28, 2009). 

11 Http://www.scena.org/lsm/sm5-io/percu-en.html (accessed Dec. 28, 2009). 

12 Http://Dic.Academic.Ru/Dic.Nsf/Enwiki/668999 (accessed Dec. 28, 2009) 
(originally cited in Slonimsky, 1994). 

13 Http://dic.academic.ru/dic.nsf/enwiki/668999 (accessed Dec. 28, 2009). 

14 David Cope, “Texture-Music” in New World Encyclopedia; http://www. 
newworldencyclopedia.org/entry/Texture_(music) (accessed Dec. 28, 2009). 

15 Http://www.newworldencyclopedia.org/entry/Texture_(music) (accessed Dec. 28, 
2009). 

16 Http://Fluxmuseum.Org/Fluxhibtion3-Intro.Html (accessed Dec. 28, 2009). 

17 Http://Www.Artnotart.Com/Fluxus/Gligeti-Poemesyrnphonique.FItml (accessed 
Dec. 28, 2009). 

18 Ibid. 

19 For a performance of Ligeti’s Poéme Symphonique, see http://www.youtube.com/ 
watch?v=X8v-uDhcDyg (accessed Dec. 28, 2009). 
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CHAPTER 9 


CONVERSIONS: 
SOUND AND SIGHT, 
MILITARY AND 
CIVILIAN 


CYRUS C. M. MODY 


Introduction 


Scholars in Science and technology studies (STS) have long noted scientists’ pre- 
dilection for converting data into visual representations. Indeed, examining visual 
representations yielded many early STS keywords. Laboratory ethnographers, for 
instance, famously reinterpreted scientists as producers of “inscriptions” (Latour 
1987). The conversión of Geiger counter “clicks” into “splodges” on a graph offered 
insights on inferential chains and interactions among subfields (Pinch 1985). 
Conversión to the visual provided a hook to mutually implícate visión and Science 
as hallmarks of modernity (Crary 1990). Visual representations became raw materi- 
als for art historical analyses of Science (Jones and Galison 1998) and histories of 
objectivity (Daston and Galison 2007). 

Visual representations, however, are neither inexorable ñor ubiquitous in 
Science. Scientists still work with their ears, fingertips, and taste buds. Some scientists 
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prefer certain data in audible rather than visual form (Helmreich 2007). Alexandra 
Supper (this volume) has described the nascent “sonification” community’s struggles 
for acceptance of this practice. Despite these struggles, sonification has found favor 
even among scientists whose work is primarily visual. I first ran across sonification, 
for instance, among scanning probe microscopists (Mody 2005), a group generally 
committed to visualization (henee micro scopy). Yet probe microscope technology 
lends itself equally well to visual and auditory outputs. The “microscope” is a small 
solid probe that scans and interaets with a surface; it assembles measurements of 
the probe-surface interaction into a data array that is usually rendered visually but 
sometimes in auditory or other forms. 

Sonification has been part of probe microscopy since its invention in the 1980S. 
The technique’s ability to image individual atoms, for instance, was discovered when 
its inventors heard — rather than saw — an unusual repetition in how a chart recorder 
printed data (Mody 2004, 107). Yet only a small minority of microscopists audibilize 
their data. Those who do offer justifications in phenomenological terms. Some con- 
sider sound better for perceiving change over time, while some combine visualization 
and sonification for a richer data environment. These reasons, however, are insepa- 
rable from sociological motivations. Sonification is usually practiced by high-end 
probe microscopists who build rather than buy their instruments. Sonification helps 
them present their work as distinctively fine-tuned and artisanal compared to the 
mass of probe microscopists. Prominent builders such as Jim Gimzewski use sonifi- 
cation to forge collaborations with artists (Roosth 2009), and other builders play 
their microscopes’ auditory output as background music during public talks. Those 
who buy their microscopes are much less likely to use sonification this way. 


Two Senses of “Conversión” 


Let us group visualization and sonification under a wider category of “synesthetic 
conversión” from one sense to another. In probe microscopy, the same people who 
practice sonification often venture into other senses as well. For instance, when the 
inventors of the technique heard their chart recorder and realized they could image 
individual atoms, they turned the chart recorder strips into a three-dimensional, 
tactile sculpture of the atoms rather than a (more ordinary) two-dimensional 
micrograph. Other prominent microscope builders who audibilize data also some- 
times use haptic feedback, where the microscope physically pushes against its oper- 
ator (e.g., when the microscope nudges carbón nanotubes or pries proteins apart). 
As with sonification, tactile or haptic representations are associated much more 
with microscope builders and tinkerers than with those who buy a microscope for 
use with few modifications. 

That is, those probe microscopists who view synesthetic conversión as an 
embodied strategy for perceiving data in nuanced ways also deploy synesthetic 
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conversión in the politics of their research community. Sonification and haptic 
feedback distinguish developers of new microscope technology from the mas s 
of microscope users. Synesthetic conversión provides unusual interdisciplinary 
bridges to fields such as art, Computer Science, and robotics, and it locates skill and 
authority in probe microscopy by showing whose data and technology are so subtle 
and sophisticated that múltiple senses (rather than visión alone) are needed to 
understand or opérate them. 

In today’s probe microscopy, synesthetic conversión is generally limited to the 
small-p politics of jockeying within a field. But synesthesia can be embroiled in 
big-p politics. To demónstrate how, this chapter traces the prehistory of probe 
microscopy back to Vietnam-era protest at Stanford University. At that time, Calvin 
Quate (the coinventor of the atomic forcé microscope [AFM], the most common 
kind of probe microscope) invented a closely related instrument, the scanning 
acoustic microscope. Acoustic microscopy and other synesthetic conversión tech- 
nologies were at the forefront of Stanford researchers’ response to calis for more 
socially responsible Science. Synesthetic conversions — from sound to sight, sight to 
touch, sight to sound, touch to sound, and so on — were embroiled in the “recon- 
versión” of American academic research from military funding and applications to 
civilian funding and an orientation to “human problems.” 

As today’s probe microscopy shows, synesthetic conversión in Science does not 
have to be freighted with ambitions for societal reform. Moreover, “reconversión” 
at Stanford and elsewhere was never limited to new synesthetic conversión tech- 
nologies. However, the acoustic microscopy case shows that, at times, projects for 
reforming knowledge making are coproduced with scientists’ conversions of 
phenomena into visual, tactile, haptic, or auditory form. 


Boiling Campus, Anxious Engineers 


In the late 1960S, “reconversión” was the widely used term for the turbulent, occa- 
sionally violent, debate about whether American academic scientists needed to 
forego defense research and take up civilian social problems. Often historians’ sto- 
ries about this period end with reconversión as the dissolution of the early Coid 
War “military-industrial-academic” arrangement. The last chapter of Bill Leslie’s 
(1993) The Coid War and American Science , for instance, is “The Days of Reckoning: 
March 4 and April 3,” the dates in 1969 when the reconversión debate erupted at, 
respectively, MIT and Stanford. 

One need only visit the Stanford University Archives — the primary data 
source both for this chapter and for the Stanford portion of Leslie’s book — to see 
how deeply reconversión affected that campus in the late 1960S and early 1970S. 
University administrators’ memos, campus newspapers, and faculty members’ pri- 
vate correspondence (including Quate’s) all speak to the frantic need to make up 
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for declining defense research funding and to either quash or appease campus 
protestors’ demands for more civilian research. Yet Leslie demonstrates that 
most of the reconversión movement’s long-term effects were cosmetic. Both 
MIT and Stanford ended classified research and divested themselves of some 
defense-oriented institutes, but most of the proposed reforms were abandoned. 
Postdivestment, centers such as the Stanford Research Institute and Draper 
Laboratory prospered and maintained closer ties to Stanford and MIT than 
reconversión proponents hoped. 

Reconversiones consequences were not all short-lived, though. Matthew 
Wisnioski (2005) sees reconversión as part of a wider movement in which American 
scientists and engineers questioned their motivations and practices, founded STS 
programs, read works by the Frankfurt School, and collaborated with artists. Eric 
Vettel (2006) sees reconversión as triggering the emergence of the biotech industry. 
Both Leslie (2010) and Wisnioski show that, at MIT, some researchers did success- 
fully move away firom military funding and applications — though those who 
succeeded were skeptical their actions could be widely replicated. 

Nevertheless, reconversión proponents heralded such success stories. As 
Holt Ashley, an aeronautical engineering professor in the Stanford School of 
Engineering, put it in a short-lived student-faculty publication dedicated to debate 
about reconversión, “Against the unregenerateness and glacial metamorphosis 
of some, I donY hesitate to place the valuable progress on socially significant 
projects led by Lusignan, Meindl, Anliker, Homsy, Pantell, McCarty, and DeVoto.” 1 
Of these, James Meindl offers the clearest instance of the entanglement of reconver- 
sión and synesthetic conversión and an explicit model for Quate’s foray into acoustic 
microscopy. 

An electrical engineer working for the Army Signal Corps, Meindl moved 
to Stanford in 1967 and began working with John Linvill, chair of the electrical 
engineering department (Lécuyer 2005). Since 1962, Linvill had been developing 
the Optacon, a device for scanning printed pages and converting text into mechan- 
ical vibrations that could be felt by a blind reader. LinvilFs original work predated 
the lean budgets and political turmoil of the late ’6os and was based partly on per- 
sonal motivations (his daughter was blind). MeindLs arrival, though, coincided 
with the first reconversión murmurs at Stanford and triggered an acceleration 
of Optacon research and its extensión into other civilian and socially conscious 
applications. 

As a 1973 brochure put it, MeindLs lab’s “central objective . . . is to prepare the 
student to use integrated circuit technology in an innovative manner in solving the 
problems of our society . . . particularly in the field of medical electronics.” 2 
“Problems of our society” was typical reconversión rhetoric. Synesthetic conver- 
sión was a recurring motif of the approach MeindLs lab took to solving those “prob- 
lems of our society”: conversión from sight-to-touch (the Optacon, or “OPtical to 
TActile CONverter”), sound-to-sight (ultrasonic soft-tissue imaging), and later 
sight-to-sound (an adaptation of the Optacon to read scanned letters aloud via 
synthesized voice) and touch-to-sound (an artificial ear project). 
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That synesthetic conversión was upheld by Stanford administrators as a model 
for reconversión can be seen in the annual reviews of Stanford’s electronics research. 
These reviews summarized the results of funding from the military Services and 
NASA. Thus, through 1968, defense applications headlined, with civilian applica- 
tions tucked at the back. In the 1968 review, for instance, the Optacon was described 
briefly (in purely technical terms) on page 27. The 1969 review, however, carne just 
four months after a student takeover of Stanford’s Applied Electronics Lab and the 
university administration’s decisión to ban classified work on campus. Suddenly, 
the Optacon was promoted to page 1, where the report pointed out the following: 

Two nontechnical aspects of the [Optacon] reading-aid project are noteworthy. 

(1) Integrated circuits, principally developed to the present stage for space and 
military applications, are powerful tools for the solution of human problems, as 
this research project illustrates. (2) Such projects . . . [are suited to] channeling 
[the] interest of the imaginative gradúate student to important social problems . 3 

The rhetoric here contains clear gestures to reconversión: “Space and military 
applications” are specifically contrasted with “human problems,” and gradúate 
students are imagined as above all interested in solving “social problems.” 


From Acoustic Wave Devices to 
Acoustic Microscopy 


Linvill and MeindEs Optacon and other synesthetic conversión technologies were a 
bright light in a dark year for the Stanford administraron. The Optacon offered a 
riposte to reconversión proponents and a guide to other faculty members, espe- 
cially those needing new funding after Stanford lost $2 million when it canceled its 
classified research contracts. It is perhaps unsurprising that Calvin Quate was one 
of those who followed their example. His research was heavily funded by the mili- 
tary and therefore jeopardized both by the new Stanford policy and by cuts in fed- 
eral defense research funding. Moreover, Quate was an ardent admirer of Meindl, 
and Linvill was his department chair. 

Quate received his doctórate from Stanford in 1950, working on microwave 
traveling tubes. He continued that research for almost a decade at Bell Laboratories 
before taking a joint appointment in electrical engineering and applied physics at 
Stanford in 1961. In the ’6os, he studied ultrasonic-electromagnetic interactions in 
crystals, particularly for signáis processing. He became well known for research on 
acoustic wave devices, by which an electrical signal is converted into an ultrasonic 
wave, processed via interaction with a crystalline matrix or tiny interdigitated struc- 
tures, and then reconverted into an electrical signal. The U.S. Air Forcé and U.S. 
Navy funded Quate to develop acoustic wave devices to pick radar signáis out of 
background noise and recognize radar patterns corresponding to specific aircraft. 
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The idea that Quate’s electroacoustic research could be adapted to microscopy 
was suggested by Rudolf Kompfner — another Bell Labs veteran — while they “were 
sitting around the swimming pool one day” in 1966 (National Science Foundation 
1978, 36; Pierce 1983). Kompfner proposed that a thin film transducer could create 
an ultrasonic wave that could be focused with a lens onto a sample (figure 9.1). After 
interacting with the sample, the transmitted ultrasonic radiation would then hit 
another transducer (or reflected ultrasonic radiation would hit the original trans- 
ducer) and be converted into an image. 

The way Quate and Kompfner spoke of their turn to acoustic microscopy ges- 
tured toward an intertwining of synesthetic conversión and reconversión. For a few 
years after their poolside chat, they did little work in the area. As reconversión 
boiled over in 1969, though, both men put aside other commitments and focused 
on acoustic microscopy in a way that, consciously or not, aligned with the aspira- 
tions of reconversión proponents. In their justifications for this work, Quate and 
Kompfner made the case that acoustic microscopy would have enormous civilian 
benefits. They did so partly by investing the visual apprehension of invisible 
phenomena with nearly magical importance in world history: 

If one makes the crude assumption that the acquisition of new knowledge 
is in some way proportional to the extensión of the scale of magnitude of the 



Figure 9.1 Schematic of (reflection mode) scanning acoustic microscope. Reproduced 
with permission of University of California Press, from Edwin S. Boatman, Michael W. 
Berns, Robert J. Walter, and John S. Foster, “Today’s Microscopy: Recent Developments 
in Light and Acoustic Microscopy for Biologists,” Bioscience 37 (June 1987): 384-94. 
Permission conveyed through Copyright Clearance Center. 




230 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


perceived objects . . . one might even be tempted to compare the total amount of 
information about nature at any time with the . . . useful magnification of the 
best microscopes. (Kompfner 1975, 619) 

It is hard to conceive of a modern laboratory involved with technology or the 
advancement of Science that is without a microscope of some kind. It has been 
suggested that the number of instruments in a given country used to extend our 
“visión” beyond the limits of the unaided eye can be used as an indicator of the 
progress and advancement of that región. 4 

That is, for Quate and Kompfner, a society capable of making progress was neces- 
sarily one where human “visión” was extended “beyond the limits of the unaided 
eye” both in scale and in the types of radiation that could be perceived. Synesthetic 
conversión was therefore inseparable from societal “advancement.” 

For the first few years, the Office of Naval Research funded Quate’s acoustic 
microscope work as part of an umbrella package to Stanford rather than an indi- 
vidual grant. 5 Before long, though, Quate disentangled acoustic microscopy from 
defense funding. I have no direct evidence that this was in response to reconversión 
pressure. There is no reason to think Quate ever had any personal qualms about 
defense funding for his research. Indeed, he was not cooperative with reconversion- 
minded students’ investigations of defense-funded research at Stanford (SWOPSI, 
1971, 215). It is probable that, like Meindl, Quate enthusiastically embraced “human 
problems” research while also viewing defense-funded research as both good 
Science and good citizenship. 

However, Quate’s turn to acoustic microscopy eerily matched specific Stanford 
administration recommendations in the wake of the first protests against on- 
campus classified research in 1966. In particular, a memo from Linvill in 1967 
detailed the electrical engineering department’s steps toward reform: 

Stanford University can and should become more effective in studying and 
attacking the problems of today’s society. Electrical Engineering, with its aim to 
bring technological tools to the solution of man’s problems, is interested to join 
with other departments in working on these contemporary problems. . . . The 
usefulness of university engineering and Science to our defense efforts since the 
beginning of World War II is clear and well documented. It is also clear that 
attention should be directed in a university to other problems. . . . [E]nvironmental 
studies, urban problems, problems of developing countries, etc. — are timely 
problems to which the university in many of its parts should be directed. These 
problems cannot be attacked within a single discipline. . . . Financial support for 
the research must be found outside the university, however, as has been the case 
in our DOD and space research. . . . Representative areas providing opportunities 
are: (a) education by teaching machines, (b) automation and cybernetics, 

(c) applications of solid-state electronics to the life Sciences, and (d) application 
of computers in life and economic systems. 6 

Note that “defense efforts” are specifically contrasted with work on “the problems 
of today’s society” and that the memo clearly encourages Stanford engineers to 
inelude more of the latter in their research portfolios. 
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Linvili/s Memo 


The acoustic microscope aligned with LinvilLs memo in a variety of ways. 
Most important, it answered LinvilLs cali for “applications of solid-state electron- 
ics to the life Sciences.” At the time, Quate and Kompfner spoke of their microscope 
as having application only in biomedicine and not in areas of interest to the 
military. a [T]he original intention behind the Stanford work was to make an 
instrument which would achieve a resolution on the order of 0.1 pm for the pur- 
pose of studying biological objects, particularly the nuclei of cells” (Kompfner 
1975, 627). 

This initial exclusive emphasis on biological specimens is curious. The first sug- 
gestion for an acoustic microscope in the West was a translation (Devey 1953) of a 
Soviet article by a program manager at the Office of Naval Research (ONR). That 
árdele mentioned only metal and glass samples, not biological ones. Yet when 
Quate and Kompfner approached the problem, they were initially oblivious to non- 
biological samples. They were acutely aware that the initial funder for the acoustic 
microscope, the military’s Joint Services Electronics Program, might object and ask 
whether Quate was “going to come up with an instrument for studying materials or 
an instrument for studying biological structures? And why is any of this related to a 
research program for electronics?” 7 Yet despite that potential objection (and their 
awareness of the 1953 ONR article), Quate and Kompfner initially downplayed 
acoustic microscopy’ s application to military- relevant materials, while highlighting 
its relevance to medicine. 

Acoustic microscopy also aligned with LinvilLs reconversión memo in that it 
enabled Quate to search for new “financial support . . . outside the university.” 
Unlike his earlier defense-funded microwave work, Quate’s acoustic microscopy 
research was funded by the John A. Hartford Foundation, National Science 
Foundation, National Institutes of Health, National Bureau of Standards, and IBM. 
Quate also licensed acoustic microscope patents to companies selling to the civilian 
market — something he had not attempted before. Those licenses were negotiated 
through Stanford’s new Office of Technology Licensing (OTL — one of the first of 
its kind; see Smith-Hughes 2001; Colyvas 2007; Yi 2008, 182-3). The rhetoric of 
reconversión suffused justifications for the OTL. As a Stanford vice president put it, 
“I am pleased by the high societal valué of inventions we have licensed [such as] a 
potential cure for viral infections and a potential ecologically safe insect control” 
(quoted in Colyvas 2007, 69-70). 

As the quote indicates, the OTL concentrated on fields favored by reconversión 
proponents, especially biomedical and environmental applications. The OTL’s ini- 
tial contacts with Stanford engineers, therefore, emphasized inventions with 
humanistic and civilian application, especially ones that used synesthetic conver- 
sión to forge interdisciplinary links. John Chowning’s patents for electronic sound 
synthesis and proprioceptive perception of sound, for instance, linked engineering 
to music (Nelson 2005). Similarly, MeindLs patents for ultrasonic blood imaging 




232 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


tied engineering to biomedicine. Meindl and LinvilFs Optacon was also commer- 
cialized in the early ’yos (Lécuyer 2005, 60). 

The OTL viewed acoustic microscopy as yet another bridge between engineer- 
ing and life Science. In 1974, the office licensed American Optical to use Quate’s 
patents to manufacture commercial microscopes. By that point, Quate had realized 
acoustic microscopy had applications in microelectronics manufacturing in 
addition to biomedicine. However, for the OTL, microelectronics had neither 
the “high societal valué” ñor the interdisciplinary cachet of life Science. For that 
reason, OTL saw microelectronics as secondary to (indeed, a subset of) biomedical 
applications: 

A license agreement for commercial development of a microscope that uses 
ultrahigh-frequency sound waves to see into living cells and other materials has 
been signed. . . . By comparing [acoustic and optical microscopy], diagnosticians 
should be able to tell more about cell functions and disorders. Since ultrasonic 
waves can penétrate materials, they also can be used to show defects in micro- 
scopic integrated circuits. The new integrated circuit technology already has 
provided the world with life-saving heart-pacers, hearing aids, minicomputers, 
and many other benefits . 8 

For the OTL, electrical engineering was worth mentioning not for itself but for its 
contribution to biomedicine. 

The OTL’s attitude was common at Stanford at the time. As LinvilFs 1967 memo 
and The Grindstones praise for MeindFs biomedical collaborations show, engineers 
working on biomedical topics were seen as “attacking the problems of today’s soci- 
ety” in a way that those working on radar-signals processing were not. LinvilFs 
declaration that biomedical, environmental, urban, and other civilian “problems 
cannot be attacked within a single discipline” was intended to make engineering 
expertise relevant to such problems and to stimulate Stanford’s engineering faculty 
to pursue interdisciplinary collaborations that would bring diversified funding and 
a positive reception from reconversión proponents. 

As Bill Leslie (1993) notes, the military had promoted interdisciplinarity 
among electrical engineers, physicists, and applied physicists at Stanford since 
the 1950S. Reconversión, however, brought calis for much wider-ranging collabora- 
tions between engineering and music, philosophy, economics, biology, and so on. 
Reconversión proponents believed narrowly discipline-based questions distracted 
engineers from solving “human problems” and shielded them from the moral con- 
sequences of their work. Working with other disciplines would allow engineers to 
become aware of a wider range of society’s problems to which they could apply 
their knowledge. 

Along those lines, LinvilFs 1967 memo proposed to: 

introduce a Sénior or Master’s level program involving both engineers 
and students from the Humanities and Sciences for the purpose of 
discussion and formulation of problems of significance which society 
needs to have solved and to which technology can contribute as a partner 
with the humanities . 9 
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The STS programs formed in this period — including Stanford’s — also reflected this 
view of the healing power of interdisciplinarity. More generally, the number of 
degree-granting interdisciplinary programs at Stanford doubled from 1968 to 1969 
(the largest relative or absolute increase in the school’s history) and grew seven 
times more rapidly in the ’zos than in the ’6os (Nelson 2005). 

Ñor was Stanford alone in connecting interdisciplinarity to civilian reconver- 
sión. For instance, when the National Science Foundation was commanded in 1971 
to fund research “to solve major problems such as pollution, transportaron, 
energy and other urban, social and environmental problems and to initiate and 
expand applied research essential to technological advancement and economic 
productivity,” 10 it did so not through its traditional disciplinary units but through 
a program called Interdisciplinary Research Relevant to Problems of Our Society 
(renamed RANN — Research Applied to National Needs). In a time of declining 
federal research funding in general and defense research funding in particular, such 
interdisciplinary funding streams provided a lifeline. Quate, for one, sought fund- 
ing from RANN for acoustic microscopy, as well as for power transmission line 
research. 


From the Life Sciences to 
Microelectronics 


So acoustic microscopy aligned Quate with reconversión aspirations by facilitating 
connections to life Science, to new funders, and to the market (and henee to wider 
society). Those connections consisted in the microscope’s ability to combine sound 
and visión and allow life scientists to see “the density, the elasticity and the viscosity 
[of cells] — properties which are far more vital to the functions of living tissue than 
the optical refractive Índex.” 11 Getting nonengineers to “see” acoustically, though, 
was harder than Quate anticipated. 

In the first place, he had difficulty weaning himself from defense funding partly 
because civilian agencies needed visual proof of synesthetic conversión before they 
would fund him. “ £ We had a concept, [but] we had no lens, we had no photo- 
graphic film, we had no way of displaying the image,’ Quate recalls. He applied for 
funding from the federal government, but was unsuccessful ‘because there were no 
images, there were no results, there was just a theory’ ” (Jacobson 1984, 132). 

Luckily for Quate, a Stanford Medical School colleague told him about the 
Hartford Foundation, a philanthropy that funded biomedical research. Hartford 
gave Quate $170,000 for two years starting in 1969, after which he had a prototype. 
From that first grant, Quate learned that the microscope would need to sean the 
lens relative to the sample (figure 9.1). Without scanning, the acoustic image 
would have to be received by an array of detectors, each kept in phase with the 
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rest — a nearly insurmountable task at the time. With scanning, a single detector 
could capture a tiny picture of the sample at any moment; by moving over the 
sample, in time a complete two-dimensional image could be generated on an 
oscilloscope. 

Quate then secured $300,000 from Hartford for the next three years by promis- 
ing to collaborate with biomedical researchers. Thus, he began soliciting colleagues 
for biological samples to be imaged with the acoustic microscope. Eventually 
he could point to samples received from the Stanford departments of surgery, 
pathology, psychology, radiation oncology, hematology, anatomy, medical micro- 
biology, and biology, as well as from the Mayo Clinic, University of California at 
Irvine, Albert Einstein School of Medicine, University of North Dakota, National 
Cáncer Institute, U.S. Department of Agriculture, and the Faculty of Medicine in 
Montpellier, France. 

Quate’s pitch to collaborators and to funding agencies pivoted on comparing 
the resolving power of acoustic microscopy with more familiar optical and electrón 
microscopes. The basic milestone Quate set himself was a comparative one: 

One of the earliest goals that we set for ourselves in the acoustic microscope 
program was to record an image using an acoustic wavelength smaller than 
that used in the optical microscopes. . . . Now it has happened — we have an 
instrument operating with an acoustic wavelength in water equal to 5450 Á! ! 12 

Later Quate described the event with even more excitement: “Never again will we 
have to State that the resolution is within c a factor of two of the optical resolution.’ 
Never again will we have to answer the query c what is it good for?’ ” 13 

As I’ve written elsewhere (Mody 2000), proponents of a new technology use 
comparisons to other artifacts at their peril. Comparisons can create a logic that 
makes it difficult to exploit the new technology’s unique abilities. In the case of 
acoustic microscopy, the technique’s resolution matched the optical microscope’s 
only after a decade of work, during which biomedical researchers provided Quate 
with samples but saw few reasons to adopt the new instrument. 

Even after Quate achieved parity with optical microscopy, life scientists 
complained that (on the one hand) acoustic microscopy still had poorer resolution 
than electrón microscopy, while (on the other hand) it was still less user friendly 
than optical microscopy. American Optical, in declining to manufacture commer- 
cial microscopes for life scientists, noted that “the key problem in encouraging 
broader use of the acoustic microscope was that it took on the order of one day 
from receipt of a specimen until a useable picture was obtained.” 14 That is, one 
reason life scientists avoided the technique was that the synesthesia of acoustic 
microscopy was not effortless enough. Biomedical researchers could simply walk 
up to an optical microscope and peer through it to get the answers they wanted, 
whereas acoustic microscopy required patience. As Quate (1985, 134) put it, “We 
can compete in resolution, but we cannot compete with the immediacy of the 
optical images. When you look, the optical image is there. For an acoustic image of 
similar quality you must wait.” 
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Biomedical researchers preferred the immediacy of visión to acoustic micros- 
copy’s mediation of sight and sound. To overeóme the invidious comparison, 
Quate pointed out that acoustic microscopy could “see” things optical microscopes 
couldn’t: 

If the observer had a choice it is not obvious that he would choose optical waves 
for the microscopic examination. . . . [F]or the most part the form, function, 
and growth patterns of cellular complexes do not depend on their optical 
properties. . . . Contrary to this the elastic properties are directly and intimately 
connected to the form and function of a cell. . . . In the new microscope acoustic 
waves in the form of propagating elastic waves — familiar as water waves in the 
oceans and sound waves in the air — are used in such a way that we can now 
monitor these elastic properties. 15 

Biomedical researchers were neither synesthetically ñor cognitively familiar enough 
with ultrasonic radiation to be comfortable with acoustic microscopy. They knew 
what an optical micrograph of a cell meant, but images made with acoustic radiation 
required a difficult translation between senses: 

The training and background of such people as pathologists and histologists 
cannot be easily transferred in a way that will allow them to interpret the 
acoustic micrographs with the same facility that is now done with the optical 
photos. . . . [WJork to date has been done in an electronic laboratory with people 
possessing a background in physics and electronics. No one in biology or in 
medical research has carried out a significant piece of research with this new 
instrument. 16 

Difficulties in interpreting acoustic micrographs stymied Quate’s efforts to deepen 
collaboration beyond biomedical researchers’ simply handing him samples. 

Without those deeper collaborations, agencies balked at funding biomedical 
acoustic microscopy research. As an NIH reviewer told Quate in turning him down, 
“[T]his research will require a more definitive commitment and interaction with 
cell biologists, preferably one who can make a daily involvement.” 17 By 1974, there- 
fore, Quate was seeking alternatives to biomedical applications and the civilian 
agencies that funded them. This shift was aided by two changes in the political envi- 
ronment of American Science. First, reconversión fervor had burned itself out. 
American universities were no longer overwhelmed by protests, sit-ins, teach-ins, 
or faculty strikes. Soul-searching publications like The Grindstone had petered out. 
While many Americans still wanted scientists to tackle society’s problems in prefer- 
ence to the military’s, few now vocally questioned the morality of researchers who 
took defense funding. 

Second, the American public’s laundry list of civilian problems that scientists 
should tackle began to converge with research problems relevant to the military. In 
the late 1960S reconversión proponents had pushed for research on environmental 
problems, urban housing, mass transit, and medical research. These were areas 
where applied physicists/electrical engineers like Quate were most out of their 
depth, as shown by his difficulties forging biomedical collaborations. After the oil 
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crisis of 1973, those problem areas faded and were replaced by alternative energy 
research. The NSF’s RANN budget, for instance, doubled from 1974 to 1975, due 
almost entirely to new funding for solar and geothermal energy research (WSF 
1974). In these areas — particularly solar — physical scientists and electrical engineers 
could contribute without having to expand their horizons as broadly. 

As the economy further deteriorated, a new problem area aróse — national 
economic competitiveness, particularly with Japan in microelectronics. In the 
dynamic random access memory (DRAM) market — the microelectronics indus- 
try’s barometer — the ratio of U.S. firms 5 share to that of Japanese firms went from 
19:1 in 1971 to less than 5:1 in 1974 to 1.4:1 in 1977 (Macher, Mowery, and Hodges 
1998). American policymakers, microelectronics manufacturers, and academic 
electrical engineers monitored this gap closing with alarm. The Japanese govern- 
ment’s announcement in 1973 of a VLSI (very large-scale integrated circuits) crash 
program prompted calis for the U.S. government to aid domestic manufacturers by 
funding academic microelectronics research. 

Here, applied physicists/electrical engineers like Quate could serve national 
needs without the complications of Crossing many disciplinary boundaries. 
Moreover, in microelectronics the goals of military and civilian funding agencies 
coincided. The military wanted to stimulate domestic manufacturers to avoid 
dependence on offshore producers for advanced chips. Also, microelectronics 
research was perceived as more likely to have dual military-civilian applications 
than the biomedical and environmental research earlier favored by reconversión 
supporters. Thus, Quate could now propose to the armed Services to use acoustic 
microscopy to inspect both microelectronic circuits and materials used in advanced 
airframes and naval vessels. 18 Perhaps the clearest sign that civilian and military 
interests aligned around microelectronics was a program that funded Quate from 
1976 to 1978 on “Innovative Measurement Technology for the Semiconductor 
Industry” — run jointly by the National Bureau of Standards (part of the Department 
of Commerce) and the Advanced Research Projects Agency (the Pentágonas 
long-range research- funding arm). 

Through that program, Quate had much more success finding collaborators 
than he had with biomedical applications. Samples of microelectronic devices for 
the acoustic microscope to inspect carne to him from Hewlett-Packard, Avantek, 
IBM, Fairchild Semiconductor, and other companies. Many of the microelectron- 
ics researchers who sent samples found acoustic microscope images useful — much 
more so than biomedical researchers had. They were particularly pleased that the 
acoustic microscope could look through a thin film to see whether it completely 
adhered to its substrate, something other instruments could not do. Since applying 
thin films is an essential step in microelectronics manufacture, catching defective 
films could save firms millions. 

For that reason, electronics firms — unlike biomedical researchers — built their 
own acoustic microscopes or bought them when they became commercially avail- 
able: Bell Labs, Hughes Aircraft, IBM, Hitachi, DuPont, Westinghouse, Motorola, 
TRW, Intel, and others are all mentioned in this regard in Quate’s correspondence. 
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This interest in turn meant that microscope manufacturéis could foresee a market 
in a way that American Optical had not when the only customers were likely to 
come from biomedicine. With Quate’s help, two firms, Leitz in West Germany 
and Olympus in Japan, developed commercial instruments for microelectronics 
customers. 


SyNESTHESIA, CONVERSION, AND 

Awareness 


With the return to electronics applications and defense funding, both reconversión 
and synesthetic conversión faded as drivers of Quate’s research. To understand 
why, we need to delve into the reason these phenomena were paired in the first 
place. Here I will be extremely speculative. For now, the documentary record is too 
sparse to directly address this point. We can, however, approach the issue by asking, 
what was reconversión really about? 

I would argüe that reconversión aimed to bring researchers into a heightened 
State of awareness — of themselves and others and of connections between the two. 
Most narrowly, this meant making researchers aware of the consequences of their 
research for the people of Southeast Asia. It was on that narrow interpretation of 
reconversión that the debate became heated and occasionally violent. Nonetheless, 
many reconversión supporters sought a broader awareness. They wanted research- 
ers not simply to be aware of society’s problems (which many already were) but also 
to see themselves as agents that possessed the capacity to solve those problems. 

Some reconversión proponents focused on institutional and political changes 
to facilítate this awareness. Lessening researchers’ dependence on military funding, 
for instance, would make them more aware of civilian problems and of the agencies 
that fund scientists to solve those problems. Likewise, the creation of interdisciplin- 
ary centers would make researchers like Quate aware of “human problems” in the 
life Sciences in such a way that they could envision how electrical engineering could 
solve those problems. Reconversion-oriented publications like The Grindstone 
pushed faculty to recognize their own political agency in effecting reconversión: 
“If the faculty truly desires to redirect the School, they must accept the inconve- 
nience of lobbying the executive to change budget priorities, and then follow 
through by providing advice and testimony, if necessary, for the Congress .” 19 

Yet reconversión was not limited to institutional and political reforms. The 
counterculture encouraged many scientists and engineers to follow a technological 
and synesthetic path to awareness as well. For them, awareness could be expanded 
by developing new technologies for integrating the human mind with a greater 
range of perception and/or for making the perceivers more aware of their place 
within society and the environment. 
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One epicenter for this approach was Quate, Linvill, and Kompfner’s former 
employer, Bell Labs. Matthew Wisnioski has detailed efforts in the late ’6os to carve 
institutional space at Bell Labs for collaborations with artists and other ways to 
humanize engineering practice. One result was the Experiments in Art and 
Technology (E.A.T.) organization that brought Bell Labs engineers together with 
artists such as Robert Rauschenberg. As Wisnioski (2005, 300-301) describes it, 
most ofE.A.T.’s few successful engineer-artist collaborations involved synesthetic 
extensions of their audience’s awareness, such as a a glass-enclosed cube with a high- 
intensity light beam, which illuminated a pile of dust that responded to acoustic 
vibrations of recorded heart rhythms” or a sculpture “sensitive to changes in its 
environment, [which] consisted of vibrating Steel rods illuminated by strobe lights, 
which modulated to the sounds in the room.” 

Kompfner worked with one of E.A.T. ’s organizers, Billy Klüver, and provided 
advice for John Cage’s contribution to E.A.T.’s “9 Evenings: Theatre and 
Engineering” shows (Klüver 1988), in which performers’ movements tripped pho- 
tocells, triggering sound sources all over New York — a true synesthetic symphony. 
Around the same time, he wrote an unpublished essay expressing his own, complex 
views on music, technology, and cognition: 

Music represents the human mind. It is the representation by the human mind of 
itself. . . . In depicting the processes, the States and the modes of operation of the 
human mind music at first did it simply and in a primitive way, limited perhaps 
by the technology which was available for its expression. As western technology 
developed and surpassed anything that existed before on this planet, it became 
possible to express music in ever increasing complexity, and to depict ever 
increasingly complex mental phenomena, so that finally music has achieved the 
power to represent all of the human mind, and perhaps even more than that. 

Music describes what human mind has not yet reached, and may perhaps never 
reach; it goes beyond the compass of the human mind just as mathematics does 
when it operates in many dimensions . 20 

Quite possibly Quate knew about E.A.T. through Kompfner. If not, he certainly 
knew about Bell Labs’ collaborations in electronic and computer-aided music 
with Stanford’s John Chowning since he played a role in Chowning’ s tenure case. 
Quate and Kompfner’s oíd boss, John Pierce (to whom Kompfner sent his essay on 
music), connected Chowning with Max Mathews, another Bell Labs musician- 
engineer. Pierce also encouraged Chowning to patent his FM synthesis work 
(Reiffenstein 2006). Later Quate was instrumental in bringing Pierce and Mathews 
to faculty positions in Chowning’s Center for Computer Research in Music and 
Acoustics. 21 Though there is no evidence Quate himself participated in synesthetic 
artist-engineer collaborations, his closest colleagues did so while he worked on 
acoustic microscopy. 

The other epicenter of technophilic-synesthetic extensions of awareness was 
the San Francisco Bay Area. At Berkeley, for instance, physicist Don Buchla was 
building particle accelerator components for the Atomic Energy Commission when 
the free speech movement pushed him toward more humanistic, civilian, and 
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synesthetic applications of his talent for electronics (Pinch and Troceo 2002, 33-36). 
Initially that meant developing a transistorized hearing aid, then a proprioceptive 
sensor for blind people that “changed pitch according to its proximity to objeets.” 
Most famously, he developed the Buchla Box, an analog synthesizer distinguished 
from its rivals by Buchla’s interest <c in involving as many senses as possible” (Pinch 
and Troceo 2002, 49). 

Similarly, Buchla’s patrón, Ramón Sender Barayón, coorganized the Trips Festival 
of 1966 with (Stanford alumnus) Stewart Brand. “In venues like the Trips Festival, the 
hippies of Haight- Ashbury sought to demónstrate the ability of technologies such as 
LSD, stereos, and stroboscopic lights to amplify human consciousness” (Turner 2006, 
178) by mixing those technologies into a single, transcendently synesthetic experience. 
Brand, in turn, was closely associated with techno -synesthetic experimentation at 
the Stanford Research Institute (SRI). Willis Harman, Quate’s colleague in the 
Stanford electrical engineering department and an SRI researcher, introduced Brand 
to LSD, and Brand subsequently connected SRI to the counterculture. Though SRLs 
classified contracts were a critical focus of reconversión protests at Stanford, many 
SRI researchers drew on countercultural aspirations in devising technologies for 
expanding consciousness to intégrate an ever-larger sensorium. 

At the time, for instance, SRI’s best-known research was its “Electronics and 
Bioengineering Laboratory’s” 1972 experiments on parapsychological synesthesia. 
Spoon bending, remóte viewing, telepathy, and ESP attempted to expand aware- 
ness until mental States could convert into physical actions (Kaiser 2011). Today 
SRI is most famous as the birthplace of the mouse and the graphical user interface 
(GUI), which debuted at the so-called mother of all demos in 1968, for which Brand 
served as videographer. Both those technologies originated in the same techno- 
synesthetic aspirations as the Buchla Box, Trips Festival, and acoustic microscopy. 
Thierry Bardini (2000) argües that their inventor, Douglas Engelbart, explicitly 
conceived them as synesthetic devices that would enable “augmentation of the 
human intellect” through feedback between proprioception (movement of the 
mouse) and visión (seeing the cursor on the screen). 

I have no evidence that Quate knew Engelbart (much less Buchla or Brand), 
but they were geographically, socially, and intellectually proximate. All three had 
múltiple connections to Stanford’s School of Engineering. More generally, the two 
organizations that defined Quate’s career — Stanford and Bell Labs — were places 
where reconversión and synesthetic conversión intertwined. Quate’s boss at Bell 
Labs, John Pierce, was one of the first prominent engineers to publicly criticize the 
military-industrial complex for “alienating engineering education from the civilian 
economy” (quoted in Leslie 1993, 252). He was also a staunch proponent of artist- 
engineer collaborations that fused múltiple senses into one synesthetic experience. 
Likewise, Quate’s Stanford department chair, John Linvill, outlined a clear program 
for reconversión two years before the campus erupted in protest — at the same time 
that he was developing the Optacon. 

While administrators like Pierce and Linvill outlined political and institutional 
reforms, engineers in Bay Area and Bell Labs workshops and laboratories focused 
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on new synesthetic technologies as a complementary or alternative path to reform. 
Most of these engineers were not politically active. Kompfner, Quate, Buchla, 
Engelbart, Chowning, Mathews, Meindl, Klüver, and others had all worked on Coid 
War technologies funded by defense agencies, usually with great pride. Yet some of 
this group also were attracted to a more humanistic, socially conscious engineering 
practice geared to “maximizing how much good I can do for mankind” (Engelbart, 
quoted in Bardini 2000, 8). Technologies for expanding awareness offered a less 
overtly politicized way to maximize the good they could do. As Fred Turner (2006, 
108) says of Engelbart, he “worked to create an environment in which individual 
engineers might see themselves as both elements and emblems of a collaborative 
system designed to amplify their individual skills.” Technologies for translating 
among visual, auditory, haptic, tactile, and proprioceptive sensations offered a way 
both to amplify individual skills and create collaborative Systems linking engineers 
to “representatives from different tribes — e.g. sociology, anthropology, psychology, 
history, economics, philosophy” (Engelbart, quoted in Bardini 2000, 16). 

Linvill may have seen projects like Engelbart’s as the intended outcome of the 
institutional reforms he proposed. The first two “representative areas” of research 
Linvill called for in his 1967 memo were a (a) education by teaching machines [and] 
(b) automation and cybernetics.” Engelbart’s whole program was a kind of educa- 
tional teaching machine, and he and Brand were closely associated with the nearby 
Portóla Institute’s work in instructional technologies (Turner 2006, 70). Both were 
also deeply influenced by cybernetics; like many, they saw it as a bridge from engi- 
neering to biomedicine, art, and music (Dunbar-Hester 2010). Likewise, Linvill and 
Quate were in cióse contact in shaping how acoustic microscopy could further 
LinvilEs institutional reforms. 

So acoustic microscopy resembled other synesthetic technologies at Stanford 
and Bell Labs in that it aimed to extend perception by fusing different senses, to 
connect engineering to more humanistic fields (i.e., the life Sciences), and to answer 
administrators’ calis for institutional reform. All of these synesthetic technologies, 
including acoustic microscopy, experienced some collapse or deflation of their 
original ideáis by the mid-i970s. The E.A.T. project dissolved due to lack of funding 
and a disastrous exhibit for the 1970 World Expo in Japan. Engelbart’s group 
imploded, and his most Creative engineers moved to Xerox PARC. Brand founded 
the Whole Earth Catalog to provide information to the commune movement, only 
to see the communes evapórate by 1973. 

A common theme of these unravelings was that new forms of liberated com- 
munication and synesthetic consciousness were too difficult to sustain — especially 
once less-challenging alternatives became available. For instance, Buchla’s Box, 
which required its operators to unlearn the entire framework of Western music, 
lost out to the mini-Moog synthesizer, which (with its preset patches and familiar 
keyboard) could be played like an ordinary piano. Similarly, Engelbart’s five-finger 
keyboard, which required learning an entirely new system for inputting informa- 
tion, lost out to the familiar QWERTY keyboard. The mouse and the graphical user 
interface, which Engelbart conceived as a means to challenge and stimulate users to 
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a new stage of “coevolution,” were co-opted by Apple for their unchallenging “user 
friendliness” (a concept Engelbart detested). 

Something similar befell acoustic microscopy. Quate’s original ideal of work- 
ing closely with biomedical researchers to better understand living cells became 
mired in life scientists’ reluctance to use an entirely new imaging technology and 
interpret an entirely new contrast mechanism — at least while more familiar imag- 
ing technologies sufficed. Thus, Quate turned away from biomedical applications 
and toward materials Science and microelectronics. There he found an audience 
that already understood how the acoustic microscope worked and what its images 
meant and who even wanted to build new microscopes for themselves. 

Quate’s return to audiences in engineering and the physical Sciences coincided 
with his palpable frustration with civilian agencies. He criticized RANN for its ten- 
dency to “revolution and abrupt changes in course” that demoralized NSF staff, 
and he rebuked the NIH for its inability to understand his difficulties collaborating 
with life scientists. 22 Defense agencies, meanwhile, showed great interest in acoustic 
microscopy and related technologies. Thus, Quate began work on an acoustic array 
imaging system that would allow the army to “see” at night and the navy to “see” 
under water (a macroscale equivalent of the array detector scheme he rejected for 
acoustic microscopy). By 1980, he and his colleagues had come to view synesthetic 
conversión as a way to signal their alignment with traditions in the military itself. 
As Tony Siegman, the director of the laboratory where Quate worked, asked: 

Have you guys ever thought seriously about heterodyning some of your rf [radio 
frequency] or microwave acoustic signáis down to audio frequencies, and letting 
some human ears simply listen to them? From what I’ve been told of the near- 
miraculous ability of human sonar operators to recognize, and pulí out of the 
noise, audible signáis which defy any kind of electronic signal processing, perhaps 
this facility could be usefully employed in acoustic microscopy or NDT also. 

[Now there’s a good basic research topic that should really be salable to ONR !]. 23 


Conclusión: Basic Science to 

Nanoscience 


Siegman’s mention of “basic research” is telling, for it had been the hete noire 
of reconversión proponents. As one pro-reconversion student put it in The 
Grindstone : 

[Sjome engineers point out that they do basic research and if someone else uses it 
for something else, it is not their fault. This is a school of engineering — of applied 
Science — not of physics. We are concerned with how to do things, not how the 
universe works. . . . The notion of “basic research” often acts as a smokescreen to 
hide what we are doing from others as well as to avoid facing the consequences 
ourselves . 24 
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The final phase of acoustic microscopy, however, took place in a postreconversion 
environment where engineers no longer faced such criticisms for doing basic 
research. 

The evolution of acoustic microscope design reflected that reality. In micro- 
electronics, acoustic microscopy was successfully applied to nondestructive inspec- 
tion of integrated circuits. Microscope manufacturers and microelectronics firms 
actively developed the technology for that application with only occasional input 
from Quate. In the Ufe Sciences, he accepted — perhaps grudgingly — that such 
applications would have to wait. Biomedical researchers provided him with samples 
but generally proved uninterested in acoustic micrographs of those samples. 
Though Quate initially justified acoustic microscope research on the basis of its 
routine use in clinical settings, biomedical researchers’ indifference meant he made 
little progress toward that goal. 

Quate therefore used their samples as test objects (Mody and Lynch 2010) to 
calíbrate high-resolution versions of the microscope for basic biophysical research 
rather than clinical application. Quate’s students of the early ’8os focused on simply 
driving the microscope toward higher resolution. Cells and bacteria contained 
small features with which to demónstrate acoustic microscopy’s improving resolu- 
tion, but Quate made little attempt to generate new knowledge of those samples 
(figure 9.2). Partly this was because high-resolution acoustic microscopes operated 
in liquid nitrogen or liquid helium rather than water. In those conditions, cells and 
tissues would die, and any images generated would be too esoteric to be useful to 
life scientists. As one reviewer put it: 

the main advantage of acoustic microscopy, besides the ability to see below the 
surface, is the ability to observe biological specimens in their natural environment, 
or inorganic specimens without the need for etching or special preparation. The 
steps required to improve resolution seem to all require temperature changes or 
altered fluid couplant properties that would remove this advantage. 23 

Quate’s research had become so basic that it had lost much of its relevance even to 
other basic researchers! 

Ironically, the idea for this Une of improvement aróse from Quate’s misunder- 
standing a biomedical colleague — the kind of misunderstanding that frustrated his 
original hopes for acoustic microscopy: 

“Gene Farber . . . said, c Why don’t you look at frozen tissues?’ So we built a 
microscope that would look at frozen tissues.” Actually, Farber was merely 
suggesting another use for the microscope. Tissue examined under an optical 
microscope is first frozen so that it can be cut. It is then placed on a slide where it 
warms to room temperature and stained to afford contrast for the viewer. [Says 
Quate,] “I thought they looked at it while it was frozen, so we build one at the 
liquid nitrogen level. Farber said, ‘What the hell for?’ But once you start looking 
at the properties of nitrogen, you soon realize it’s a better instrument, it’s a better 
resolving power.” (Jacobson 1984, 135) 

By 1982 Quate’s students were almost exclusively operating the microscope in 
liquid helium-3, at the coldest temperatures in the universe. 26 Achieving acoustic 
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Figure 9.2 Cryogenic acoustic microscope image of myxobacteria (left) compared 
with transmission electrón microscope image (right). The scale bar in both 
micrographs represen ts one micron (one-millionth of a meter). Reproduced with 
permission of University of California Press, from Edwin S. Boatman, 
Michael W. Berns, Robert J. Walter, and John S. Foster, “Today’s Microscopy: 
Recent Developments in Light and Acoustic Microscopy for Biologists,” 
Bioscience 37 (June 1987): 384-94. Permission conveyed through Copyright 

Clearance Center. 


microscopy’s theoretical máximum resolution replaced biomedical collaboration 
and discovery as the end goal of Quate’s research program. 

So when he read an article (Schwarzschild 1982) that April about an entirely 
new microscope that used tunneling electrons to probe a metal or semiconductor 
surface, Quate abruptly moved into the area. Scanning tunneling microscopy 
(STM) was, in many ways, a natural extensión of acoustic microscopy. Quate’s stu- 
dents ported significant pieces of acoustic microscope technology to STM. The 
means of scanning an STM probe over a surface, for instance, resembled those for 
scanning the acoustic lens. The circuits for controlling the sean and outputting the 
microscopes’ signáis to a visual format were closely related. Quate was also confi- 
dent that the STM could become a nondestructive testing tool for microelectronics 
manufacturing in the same way acoustic microscopy had. 

The social relationships built around acoustic microscopy were also critical to 
Quate’s success in STM. Indeed, Quate’s introduction to STM carne through an 
acoustic microscopy colleague — after reading that article, he obtained a meeting 
with the STM’s inventors through Eric Ash, a British acoustic microscopist. As it 
happened, IBM, which had sponsored Quate’s acoustic microscopy work, was also 
where the STM was invented — so Big Blue naturally funded Quate’s STM research 
as well. Several of the students and postdocs who worked on acoustic microscopy 
had left Stanford for IBM in the early ’8os; when these people learned their mentor 
had moved into tunneling microscopy, they followed. Moreover, once it became 
obvious that STM was not conducive to nondestructive testing, Quate convinced 
IBM to send one of the STM’s coinventors to spend a year at Stanford so they could 
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coinvent the atomic forcé microscope (AFM), a related instrument that can nonde- 
structively test microelectronic components (Mody 2004). 

Acoustic microscopy’s deepest influence on Quate, though, may have been that 
it offered a new way to think about the objects of research. Before acoustic micros- 
copy, Quate worked on phenomena (microwaves, surface acoustic waves) or areas 
of application (signáis processing, nondestructive testing). In the late Vos, however, 
Quate began arguing that his expertise lay in a size scale — the submicron regime — 
that was critical to national security and economic competitiveness: 

We propose here a program of generic research in the field of acoustic micros- 
copy . . . with a resolving power that approaches one-tenth of a micron. The 
realization of this goal will open new vistas in this field of “Microscience.” . . . 

In the special issue on “Microscience” ( Physics Today , November 1979) 

[and] ... a report from the National Research Council (“Microstructure 
Science, Engineering & Technology”) . . . the case is stated. . . . “Microstructure 
Science, engineering, and technology are essential to . . . continued economic 
well-being.” 27 

Expertise in “microscience” would not be pinned to any particular instrument but 
would range across all conceivable means of augmenting the senses: 

Unconventional methods for imaging in the microscopio world represent an 
emerging technology that permits us to go beyond what is possible with the 
optical or electrón microscope. These technologies are based on other forms of 
radiation such as acoustic waves, tunneling electrons, ions and X-rays. 28 

Quate and his associates have lived this philosophy to the fullest. For several years 
in the ’8os and Vos, for instance, his former postdoc, Kumar Wickramasinghe, was 
inventing a new microscope every six months. 

As “microscience” morphed into “nanotechnology” in the Vos, veterans of the 
Quate group were therefore well positioned as the leaders in the field. For instance, 
when President Clinton gave a speech announcing the formation of the National 
Nanotechnology Initiative, the backdrop behind him was an STM image made by 
one of Quate’s former acoustic microscopy students. 29 By focusing on size scale 
rather than phenomena or applications, Quate has retrieved some of the interdisci- 
plinary aspirations that were stymied in the 1970S. When Quate first contemplated 
the STM, for instance, he waxed enthusiastic that it would ensure that “someday we 
should return to cells.” 30 

Synesthesia, however, is largely missing from aspirations for nanotechnology. 
As I explain in the introduction, probe microscopists still listen to their data but 
with none of the utopian connotations of the late Vos. The moment when synes- 
thesia and reconversión were two sides of the same reformist coin passed quickly. 
For a few years around 1970, though, these senses of conversión were intertwined 
in ways that shaped how (and why) scientists and engineers generated new 
knowledge. If we consider the role of the senses in research — and particularly the 
ways laboratory workers move from one sense to another — then we need to be 
mindful of the historical specificity of these conversions. What is simply the natural 
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manipulation of data in one era can, in another, be the hook on which utopian 
reforms, unsettling convulsions, or appeals to new audiences are hung. 

NOTES 
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CHAPTER 10 


THE SEARCH FOR 
THE “KILLER 
APPLICATION”: 
DRAWING THE 
BOUNDARIES 
AROUND THE 
SONIFI CATION OF 
SCIENTIFIC DATA 


ALEXANDRA SUPPER 


Introduction: “The Ear Has Somehow 

Had a Bad Lobby” 


This chapter deais with the sonification of scientific data, frequently defined as “the 
use of nonspeech audio to convey information” (Kramer et al. 1997). While our ears 
interpret information all the time, as many of the contributions to this handbook 
show, the usage of sound to represent scientific data remains contested. My focus in 
this chapter is thus on the strategies that the practitioners of sonification utilize in 
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order to establish the legitimacy of sonification as a scientific method of data 
display. 

The last two decades have seen the establishment of the International Community 
for Auditory Display (ICAD), as well as an increasing popularity of sonification in 
general — elsewhere in this volume, Sterne and Akiyama go as far as declaring that 
we live in “an age of unprecedented sonification’ 5 (Chapter 23 , page 557 ) . Sonification 
has been applied to a wide variety of data and phenomena, ranging from seismo- 
graphic data to election results, from molecular structures to the electrical activity 
of the brain. The different sonifications not only represent a multitude of different 
kinds of data but also sound very diverse: While some sonifications lull the listeners 
with the sound of orchestral music, others might prompt them to dance to a techno 
beat, while yet others stay clear of any musical connotation and instead rely on 
abstract clicks reminiscent of a Geiger counter. A shared underlying assumption of 
these approaches has been that an auditory display and analysis of scientific dataseis 
might not only be helpful for scientists who have a visual impairment but might — 
often as a complement to existing modes of representation (e.g., statistical tables, 
verbal descriptions or visualizations) — also yield a more thorough comprehension 
of certain scientific data and phenomena. 

In the light of several centuries’ worth of scientific tradition dominated by the 
sense of visión (Wise 2006), this is a bold claim to make. A thought experiment 
proposed by Kenneth Gergen, in an essay that discusses the centrality of visual lan- 
guage for the rhetorical accomplishment of objectivity, underlines just how bold: 

The language of objectivity is primarily a language of visión. A typical research 
description in psychology, for example, will speak of subjects, questionnaires, 
tachistoscopes, chimpanzees, and so on . . . . If the subjects were described in 
terms of smell, questionnaires in terms of taste, tachistoscopes in terms of touch, 
and chimps in terms of sound, the descriptions would rapidly be discounted — 
merely the personal and subjective experiences of the investigator — potentially 
biased and unreplicable. (Gergen 1994, 277) 

Yet this chapter deais with a community of researchers who do attempt to describe 
and understand the world in terms of sound and who turn this thought experiment 
into a “breaching experiment” (Garfinkel 1967). Much like Garfinkel’s experiments 
have unveiled social norms that are usually taken for granted by violating them, 
sonification violates established customs of scientific data display and thus makes 
evident how much is taken as self-evident about what constitutes an acceptable way 
of representing scientific data. However, the practitioners of sonification are not 
(necessarily) doing this as an ethnomethodological experiment in epistemology; 
rather, they are looking to establish the legitimacy of sonification as a scientific 
technique. In this chapter I highlight the strategies they employ in order to accom- 
plish this balancing act of seeking acceptance for a scientific discipline that breaks 
with entrenched conventions of scientific analysis and display. 

To be sure, Science wasn’t silent before the sonification movement carne along 
and made it sound. In recent years, scholars of Science and technology (Lachmund 
1999; Sterne 2003; Mody 2005; Kursell 2008) have shown that sound is an 
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important element of scientific and medical practice and that the monopoly of 
visión has never been quite as complete as assumed by many who have focused 
exclusively on visual aspects. Indeed, some studies of visualization have themselves 
undermined the belief in the dominance of visión by acknowledging the role of 
other senses in the production of scientific knowledge. 1 

Some studies of sound in Science have investigated fields such as acoustics 
(Kahn 2002; Thompson 2002; Pantalony 2005), where sound is the very object of 
study. Others have given attention to the sounds of the laboratory in fields that are 
not explicitly dedicated to the study of sound (Schmidgen 2003; Mody 2005), point- 
ing out that sound a is pervasive in laboratory life and impinges on experimental 
experience in surprising and often epistemologically significant ways” (Mody 2005, 
193). In doing so, they have shown that Science, if you move upstream from its 
public representations to the places where it is actually produced and takes shape, 
turns out to be less silent than often assumed. If Science appears soundless, that is 
not because it never makes a sound but because the laboratory walls not only keep 
the unwanted sounds of the outside world out (Schmidgen 2008) but also trap the 
sounds of Science inside the lab, out of earshot of the public. 

In this chapter I cross over from the confines of the laboratory to the confer- 
ence hall. Whereas the studies discussed earlier have looked at sound as an object of 
scientific inquiry or at its importance for the daily practices of scientists, I study a 
community that attempts to emancípate the sounds from their hidden existence 
inside the black box of Science by using them as a tool in the analysis and represen- 
tation of scientific data. In doing so, these researchers have to face conventions of 
scientific data display that are not exactly sympathetic to auditory representation — 
as one of my interviewees succinctly put it, “The ear has somehow had a bad lobby 
for a very long time” (FD, 3). 2 

The sonification researchers therefore have some crucial lobbying work to do 
for the ear. In this chapter I analyze the different strategies with which they attempt 
to make listening to data a legitímate way of dealing with scientific information. 
I understand these activities as “boundary work”: In order to position their work 
and infuse it with legitimacy, these researchers engage in negotiations of the bound- 
aries of their field, attributing it with certain qualities to establish its cultural author- 
ity and demarcating it from other endeavors. The concept of boundary work has 
been used to study the demarcation work with which Science is distinguished 
from its “others,” such as politics, engineering, and religión (notably by Gieryn 
1995), as well as at the internal boundaries of Science, those between disciplines 
(Amsterdamska 2005; Burri 2008). This chapter is concerned with both, investigat- 
ing how the boundary between Science and nonscience (notably art) is negotiated 
within the domain of sonification, as well as its positioning vis-á-vis other scientific 
disciplines. 

I look at these processes from four different angles: In the first section, which 
focuses on the establishment of a core sonification community, I study debates 
about how to best define the field, as well as their implications for who may (or may 
not) legitimately speak for sonification. Subsequently, I discuss the community 5 s 
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search for a “killer application” and how expectations shape the community — 
which means following these expectations as they branch out from the community 
into commercial, academic, artistic, or popular spheres. I then zoom in on one 
particular borderline where (albeit peaceful) boundary conflicts loom large in the 
domain of sonification: that between Science and art. Returning to (relatively) firm 
disciplinary academic ground in the last section, I get to the bottom of notions of 
“quality” and strategies for assessing it. 

This chapter is based on empirical research conducted in 2008 and 2009. The 
methods range from the conduct of twenty-two semistructured qualitative inter- 
views (listed in the appendix) with practitioners of sonification, mostly from within 
the International Community for Auditory Display (ICAD); a discourse analysis of 
ICAD conference proceedings and related journal articles, books, and dissertations; 
and a few short-term ethnographic studies. These episodes of participant observa- 
tion inelude two weeks of self-experimentation in learning sonification while based 
in an institution that is active in the field, as well as attendance at two ICAD confer- 
ences, two sonification workshops, and several sonification-related concerts and 
talks. 


“Sonification Is Defined As . . .” 


The first ICAD conference was held in 1992 at the Santa Fe Institute in an effort to 
pool the previously scattered activities of various researchers working on issues 
related to sonification and auditory displays. Despite being labeled an “interna- 
tional conference,” the character of this gathering, with its thirty-six (predomi- 
nantly American) attendees, was closer to a workshop and a rather personable one 
at that: The participants’ ñames are printed on the back of a T-shirt commemorat- 
ing the event, and the proceedings volume (Kramer 1994) comes with a group pie- 
ture. Gregory Kramer, conference organizer and founder, explains that, even prior 
to the first conference, it was never intended as a one-time congregation; he recalls 
thinking the following: 

This is a field, this could be, this should be, this will be, a field. ... So calling it 
something as pompous as the International Conference on Auditory Display, 
when really it was a handful of researchers doing something, a handful of 
researchers not doing anything, and a handful of people who were kind of 
interested maybe — you know, it’s a little bit of a reach, but I trusted it. (GK, 3) 

This desire to establish a new field along with the conference is noticeable in the 
book publication, which contains not just a collection of conference papers and an 
accompanying CD but also a lengthy, historically informed background paper on 
auditory displays by Kramer and an annotated list of resources and publications. 
Throughout the book, it is acknowledged that a systematic and coherent terminol- 
ogy for the field has yet to be provided — see, for example, Kramer’s own preface 
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(Kramer 1994, xxvi) or the contribution by Scaletti (1994, 224). Yet the book does 
propose some “working definitions” (Scaletti 1994, 224), which continué to be an 
important resource for the field today. 

The next big step in codifying sonification was a report for the U.S. National 
Science Foundation. Prepared at a preconference workshop at the third ICAD (held 
in Palo Alto in 1997), the report provided some stock taking, as well as recommen- 
dations for a research agenda, and included what remains one of the most-cited 
definitions of sonification today: “Sonification is defined as the use of nonspeech 
audio to convey information. More specifically, sonification is the transformation of 
data relations into perceived relations in an acoustic signalfor the purposes offacilitat- 
ing communication or interpretation” (Kramer et al. 1997; original emphasis). 

Several characteristics stand out in this definition: It specifies a process of trans- 
formation (something which wasn’t already audible is turned into sound), it 
emphasizes the relational character (it’s about expressing relations, not precise 
numerical valúes), and it indicates the purpose of sonification (communication or 
interpretation). The most explicit exclusión is that it categorically exeludes speech 
displays; the other elements of the definition appear more elastic. 

Compare this to a newer effort at defining sonification, proposed by Thomas 
Hermann at ICAD 2008. Hermann suggests that “a technique that uses data as 
input, and generates sound signáis . . . may be called sonification, if and only if’ 
(Hermann 2008, 2) a certain number of other criteria are met. Hermann’s criteria 
specify that the sound refleets objective relations or properties in the input data, that 
the transformation is systematic , that it is reproducible> and that the same procedure 
is applicable to different data. Unlike in the previously quoted definition, then, the 
systematicity and reproducibility of sonifications are made explicit here: It’s 
not enough that the sound tells us something about underlying data relations; that 
relationship must also be thoroughly systematic, and the output sound has to 
be the same, not merely similar, when the process is repeated. In other words, what 
we are witnessing is the establishment of a set of procedures to reduce subjective 
intervention in an appeal to “mechanical objectivity” (Daston and Galison 2007). 

This new definition involves the drawing of a sharper, less permeable boundary 
between “scientific sonification” and “artistic sonification,” or rather, it denies the 
existence of the latter category altogether and therefore declares the distinction 
redundant: “Being a scientific method, a prefix like in ‘scientific sonification’ is not 
necessary” (Hermann 2008, 3). Defining sonification in such a way also means to 
declare it as the jurisdiction — in Andrew Abbott’s (1988) sense — of scientists, in 
particular, those scientists who have an intímate knowledge of the conventions of 
sonification research. While contributions by artists and composers are not ruled 
out categorically, they at least have to submit to an explicitly scientific logic. 

This is not an insignificant point: Although the term sonification is not usually 
used by these artists, there is a rich tradition, especially within the domains of 
contemporary classical music and sound art, of transforming data into sound for 
musical purposes (Schoon and Dombois 2009). On the other hand, there are also 
scientists who, without any contact with the sonification community, use sound as 
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a resource for popularization activities. For example, when giving popular talks, 
some asteroseismologists play “the sound of a star” in order to convey an idea of the 
stellar oscillations that they research. These scientists might regard these sounds as 
useful tools for popularization, but they are generally quite skeptical about the sci- 
entific accuracy and reliability of auditory displays and see no need to pay too much 
attention to the specifics of how these sounds were generated (CA, 7; JH, 5; DK, 14). 
For their purposes, such minutiae are of no importance: “Im interested in the 
physics of these stars. I use the sounds to understand that. . . . But it’s not a goal by 
itself 5 (CA, 6). 

In other words, the researchers gathered at ICAD are not the only group of 
practitioners who transform data into sound. Proposals of a more rigid, scientific 
definition of sonification also serve to establish the ability to produce something 
that would qualify as a sonification as the exclusive competence of the core sonifica- 
tion community. The definition thus has implications not just for what counts as a 
sonification but also for who is considered qualified to make one. 

It is perhaps no coincidence that such definitional efforts occur at the same 
time as a general trend to formalize the community. Kramer’s (1994) volume is still 
the closest thing the field has to a handbook; there is no journal explicitly dedicated 
to sonification, while the annual ICAD conference proceedings serve as the most 
important outlet for publications; there are no chairs or dedicated university posi- 
tions in sonification; it is still difficult to convince funding agencies of the merits of 
sonification research. Yet the last few years have seen increasing efforts to push for 
an institutionalization of the field: The structure of ICAD is becoming more formal 
(BW, 7), and a new handbook and a journal are in the works (BW, 7; TH2, if.). 
According to Bruce Walker, ICAD president, this means that the next few years will 
see an increase in definitional efforts: 

Now, as soon as you have a journal . . . you have to have an editorial statement 
and a mission statement for the journal. And that formalizes exactly what that 
journal is about. Now if that journal is the flagship journal for our community, 
then that will have the effect of kind of defining what we consider to be representative 
research, or representative papers. ... So you will see more and more explicit 
definition of the field in the next two or three years. (BW, 7) 

Clearly, debates over what sonification is (and is not) attract considerable interest 
within the ICAD community: After Thomas Hermann’s 2008 talk proposing a new 
definition of sonification, the discussion went far beyond the few minutes reserved 
for responses at the end of the presentation and reached into various coffee breaks 
and dinner conversations. What was at stake was not so much the specific criteria 
Hermann proposed but rather the question of whether it is a good idea in the 
first place to try to narrow down the boundaries of the field. Some have argued 
that any definition should be flexible and fluid enough to accommodate future 
developments of the community rather than intervening in favor of any one par- 
ticular conception of the community and its techniques. Stephen Barrass reasons 
that “the suggestion that sonification should be more scientifically rigorous is 
misguided,” 3 as it makes sense for particular approaches to sonification that exist 
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within the field but not for other traditions that are informed by other kinds of 
(e.g., humanist) theories. To some, this might be an issue of principie; to others, 
one of timing: As one commentator at the ICAD 2008 session remarked, it was “too 
early” to restrict the boundaries of sonification, as only very few really convincing 
examples have yet been found. In the light of this situation, argued the commenta- 
tor, it would be a mistake to narrow down the definition now. If an artist were to 
develop a really convincing auditory display that would effectively convey some- 
thing about a dataset without fulfilling all of Hermann’s criteria, it would be a loss 
for the field if this example could not count as a sonification. With these consider- 
ations, we are already addressing the issues that are the central topic of the next 
section: the capacity of expectations for the future to shape a community. 


“That Might Be a Fairly Significant 

Killer Application” 


The role of expectations becomes most manifest in the community’s discussion 
about finding a “killer application” for sonification — precisely because, it seems 
widely agreed upon, this killer app still has to be found. 4 It seems significant that it 
is an “application” we are talking about here: Sonification deais in applied knowl- 
edge and is on the lookout for a Service that could extend past the core community 
into commercial, scientific, artistic, or popular domains. Certain expectations are 
created as a resource for legitimizing research and mobilizing for funds, publicity, 
and support of other actors — all of which are functions of expectations discussed 
by Van Lente (1993) in an STS study on the dynamics of expectations in techno- 
logical developments. Expectations are also drawn upon and created by actors in 
specific research communities in order to justify and push their field (Guice 1999) 
or a particular visión of their field (Hedgecoe and Martin 2003). It is this last aspect 
of expectations that I focus on here: How are expectations for sonification appro- 
priated to shape a particular conception and visión of the community? 

The term killer application is borrowed from informatics, where it “refers to an 
application program so useful that users are willing to buy the hardware it runs on, 
just to have that program” (Juolo 2008, 76). Juolo gives some historical examples of 
killer apps (e.g., early spreadsheet software, which showed such promise for book- 
keeping and decisión making that businesses bought computers just to be able to 
run these applications). Similarly, a videogame would be considered a killer app if 
it were so popular that thousands of gamers would buy a new consolé just to be able 
to play it. 

For sonification, the term is used in a metaphorical sense: A sonification would 
be considered a killer app if it were so convincing that it would make people “buy 
into” the idea of sonification in general, contributing to its acceptance — not if it 
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made practitioners run out by the million to buy new soundcards or speakers so 
they could hear the sounds better. Indeed, in many cases it will make sense to keep 
the cost of entry low by avoiding the need for new technical gadgets: For many sci- 
entists, engaging with sonification already requires a leap of faith, so it doesn’t help 
to expect additional expenditures (e.g., new hardware or programming skills) from 
them. Consequently, Florian Grond, who works on a sonification platform for 
molecular structures and dynamics, realized after talking to a number of chemists 
that it would be better to provide this sonification utility in the form of a plug-in for 
a widely used, open-source software package for data visualization rather than to 
expect them to learn an entirely new programming language. This means to link the 
sonification application not only to existing tools but also to established modes of 
representing data and the skills required to work with them. Grond acknowledges 
that the chemists have a point in insisting on a combination with visual displays: 

And that’s quite obvious, actually — because they have acquired expertise in 

fworking with visualization], and if you told them now: “as of tomorrow, only 

listening and no more seeing,” then they would throw away the expertise which 

they have acquired over many years. I wouldn’t accept having to do that, either. 

(FGi, 11) 

Here Grond illustrates a lesson that has been pointed out by scholars of Science 
and technology studies: If a (research) technology is to succeed, it has to be adapted 
to and embedded into existing cultural practices in order to be appropriated into 
specific contexts of use (Bijsterveld 2004; Borck 2006). If the new technique is too 
foreign, too far removed from scientists’ daily practices, they will be reluctant to 
give it a chance. This is perhaps one of the reasons that many sonification research- 
ers focus their hopes of developing convincing displays on medical applications: 
Not only is there the poten ti al of coming up with Solutions that might pro ve ben- 
eficial to public health (EB, 8; TS, 14), but many perceive that they already have a 
foot in the door in the medical field (GBi, 1; TH2, 12), where existing listening prac- 
tices have helped to prepare medical practitioners for the possibilities of auditory 
displays. While many sonification researchers complain about the lack of an educa- 
tion in listening skills, the medical field is one domain where students do learn how 
to listen (see Rice, this volume). 

Thus, even though sonification breaks with some of the entrenched conven- 
tions of scientific data display, its success will depend on the ability to tie in with 
existing practices and skills. Of course, it also has to offer some clear benefits over 
existing approaches to data analysis and representation — it must carve out its own 
niche. The exact contours of such a niche are still unclear, as the demands are at 
times contradictory: For example, in order to be of use to users who have a visual 
impairment, who need to compénsate for their lack of visión, a sonification 
might be at its best if it conveys something that a visual display is good at (TS, 14); 
to a person whose sight is intact, it might be more useful if it compensates for 
weaknesses in visual displays (MG, 13; TH2, 11). Different user groups might also 
have very different demands for accuracy versus aesthetic appeal (BW, 2). Another 
question is whether the sonification should be directed at a small community of 
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specialists or appeal to a popular movement; while many projects have gone for the 
first strategy, there are also attempts to anchor sonification more firmly in popular 
culture (e.g., in social networking applications) (SB, 3). Others argüe that “the 
only way to succeed will be to carve out a prototype or a convincing example and 
to demónstrate it on one disease, one symptom, one little córner where you can 
convince specialists” (GB2, 2). 

It seems widely believed that most of the sonification work being done so far 
consists in “some very interesting prototypes” (EB, 3) but that there is very little 
in the way of generalizable knowledge, guidelines, or best-practice models — 
which arguably becomes more of a problem precisely as the community expands 
(EB, 3). Acknowledging that their field is still in its infancy, many practitioners of 
sonification rely on promises for the fu ture to justify their work. Talk about “killer 
applications” seems to occur in the future tense. 

Yet it is not projected into the very distant future; it could happen any day. One 
of my interviewees relates the following: 

And funnily enough, a fortnight ago, on Radio 4, a very mainstream BBC 
radio station, there was some physicists who had some sonifications of activity 

going on internally within stars You know, I thought that might be a fairly 

significant killer application, it certainly got quite a bit of publicity. (TS, 15) 

However, when he contacted one of the scientists involved, the reaction was quite 
disappointing: The researcher replied that the sound display was intended as a pub- 
licity gag, not as a serious research component. Not much of an ally in the search 
for the killer app. 

Yet this readiness to accept an example ffom outside of the ICAD community 
as a possible killer application seems symptomatic. We have witnessed this before 
in the previous section, when a commentator at ICAD warned against narrowing 
down the boundaries of the field, as this might result in excluding some very con- 
vincing sonification examples. To some extent, this openness is built into and 
explicitly fostered in the community — for example, with sonification concerts 
organized in the context of the conferences to stimulate involvement by composers 
or with the invitation of keynote speakers who are often not “the kind of person 
who would come to ICAD unless they were invited or unless the conference was in 
their city” and who can thus be introduced “to a field that they may not even realize 
is related to their work” (BW, 6). This openness is partly perceived as being in 
“the nature of the field” (GK, 4), but it’s also explained by the potential of learn- 
ing from each other (BW, 6), as well as a strategic choice to keep the area broader 
in order to build momentum, to get some “critical mass” (GK, 4). As we have 
already seen, this reasoning is rooted in expectations for the development of the 
community in the search for a killer app. Expectations thus not only guide the 
direction of research conducted in the field by favoring some types of research 
over others but are also used as a resource in shaping a conception of sonification 
that emphasizes openness, broadness, and self-reflexivity as characteristics of the 
community. 
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“What ‘Art’ Means for the Art System 
and What ‘Science’ Means for the 

Science System” 


Drawing the boundaries of sonification often involves questions about the relation- 
ship between Science and art, and I address these questions in more detail in this 
section. We have already seen how attempts to define sonification as a scientific 
(objective, reproducible) procedure constituted an effort to draw a sharper bound- 
ary between sonification as a scientific application and musical approaches of 
making data sound. However, we have also seen that these efforts of boundary work 
have not remained undisputed, as many consider the openness of the community 
an asset in the search for the “killer app.” 

This dispute might be interpreted as a struggle between different factions within 
ICAD, one in favor of doing only the most straightforwardly “scientific” sonifica- 
tions and cutting the ties to the artistic community, the other stressing the valué of 
artistic contributions and the need for openness. However, this interpretation 
would be too facile. It would be hard to reconcile, for example, with the perfor- 
mance of Thomas Hermann, who would have to be considered a spokesperson of 
this ostensible “scientific faction,” at the 2008 Wien Modern festival for contempo- 
rary music, presenting — with Gerold Baier — a “live sonification of the human 
EEG,” using the brainwaves of composer Alvin Lucier. 

Rather than an issue of stable splinter groups within the sonification commu- 
nity trying to enforce their interests and impose their idea of how scientific/artistic 
sonification should be, I show that these boundaries are negotiated in context- 
specific ways. Take the example of a sonification talk given at a neurological 
workshop, reported by Gerold Baier: When he and his colleague presented their 
EEG sonification, one of the “big ñames” in neurology demonstratively left the 
lecture room and was soon followed by a number of other researchers. Even those 
who remained in the hall for the remainder of the lecture displayed reactions that 
Baier likened to those to free jazz performances during its prime: open mouths, 
signs of disapproval, confusión about what they were being confronted with. Those 
who were willing to engage with the presentation at all insisted on addressing it on 
musical rather than scientific terms — to the dismay of the presenters, who were 
there to discuss their results with epileptic data, not elements of composition. Yet, 
however undesired it may have been by the presenters, the association of their talk 
with music was not a complete accident, as the title of their talk (mentioning an 
“unpredictable concert”) itself elicited musical connotations. When asked about 
this apparent contradiction — not wanting a paper to be associated with music, yet 
invoking music in its very title — my informant explained that the title was not 
chosen entirely voluntarily; the musical reference was proposed by the organizers 
and was for the presenters the label via which the material could be presented at the 
workshop at all (GBi, 1). 
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This is an interesting case because it combines two aspects of boundary work 
that are often discussed separately: Scholars of boundary work often focus exclu- 
sively on rhetorical demarcations of a field (Gieryn 1995), or they interpret interdis- 
ciplinarity in terms of a Crossing of boundaries (Klein 1996). Here, however, we see 
that the two go hand in hand; Demarcation and boundary Crossing occur simulta- 
neously. As Willem Halffman points out, “boundary work has the double nature of 
dividing and coordinating” (Halffman 2003, 7°)> of demarcating as well as specify- 
ing conditions under which demarcations can be crossed. In the case of the earlier 
anecdote, it was the very promise of Crossing the boundaries between Science and 
music that opened up possibilities of presenting in this scientific venue in the first 
place. However, the association with music was not desired by the presenters them- 
selves, who were quick to rhetorically demárcate themselves from being positioned 
on the artistic end of the science-art spectrum. While the researchers accepted and 
went along with the musical label to some extent (they did not turn down the offer 
of presenting at the workshop) because it provided opportunities for them, in the 
next step they tried to subvert this categorization. 

This shows that the position of sonification in relation to Science and/or art is 
not a given but becomes a balancing act that is dictated as much by strategic deci- 
sions and the desire for an audience as it is by the penchants of the practitioners. In 
other words, the categories of “science” and “art” do not refer to distinctly bounded 
domains; the distinction becomes permeable and mutable: subject to boundary 
work. 

Such negotiations about the role of art (specifically, music) for sonification 
become visible in debates within the community about finding a language that 
allows for communication across different disciplinary backgrounds — an issue 
that has also been given attention in STS studies of collaboration, notably in 
Galison’s (1997) discussion of pidgin and creóle languages. While a pidgin refers to 
a contact language developed by native speakers of different languages in order to 
be able to communicate across specializations on delimited issues, a creóle is more 
comprehensive and can be taken up as a native language for a new generation of 
researchers. 

According to many in the community, sonification lacks such a comprehensive 
common language, although there seems to be a shared understanding that it will 
become necessary to develop one. Such an emerging language for sonification is 
considered to have “patchwork” (THi, 7) qualities and should be suitable for ana- 
lytic rather than emotional discussions of sound and data — a vocabulary that, so 
far, is considered to be largely absent (GB2, 5). In some instances, this missing 
vocabulary can be replaced by terminology borrowed from visualization — a prac- 
tice that is eyed suspiciously by parts of the community, as evidenced by debates at 
ICAD 2009 about the need to break away from the appropriation of visual language 
and develop a language for sound. 

To what extent does musical discourse come into play in developing such a 
language? A basic literacy in musical vocabulary, my interviewees seem to agree 
(TS, 11; EB, 4), is helpful for working with sonification. This does not mean that 
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musicological terminology itself is a sufficient foundation for talking about sound, 
“especially as it is made for a different context. It is made, for instance, for a certain 
culturally grown concept of harmonio progressions” (GB2, 5), and using musical 
vocabulary therefore means dragging along the entire cultural context it sprang 
from. Indeed, some even regard a traditional musical training with very strict con- 
ventional ideas of harmonics and rhythmicity as a potential constraint for doing 
sonification (AdC, 12). 

Terminology is just one of the things that sonification could learn from music. 
Musicians and composers — especially those who come from a contemporary and 
experimental musical tradition (AdC, 12; GB2, 5) — are also thought to be able to 
bring other assets to the table, such as the ability to listen for structures (GB2, 2), 
certain technological tools (GK, 12), the freedom to choose one’s format of publi- 
cation (FD, 3) and to experiment with ideas that are not a guaranteed success 
(AdC, 11), the diversity of different realizations of one idea (SB, 5), aesthetic consid- 
erations (GB2, 6; FD, 11; GK, 12), and the familiarity with a broad spectrum of 
different kinds of sounds and knowledge of the different possibilities they allow 
(AdC, 12). Besides, in a field that is chronically underfunded and barely established 
in the scientific domain, the possibility of Crossing over into artistic territory opens 
up opportunities for funding and publicity. In the light of these considerations, it is 
no surprise that sonification frequently branches out into artistic contexts or invites 
participation from musicians (e.g., with the organization of concerts in the course 
of ICAD conferences). Clearly, a lot of boundary Crossing is going on here. 

However, boundary Crossing goes hand in hand with demarcation, and the 
practitioners of sonification are often quite adamant about drawing a boundary 
between sonification and music. Thus, at a talk given in the context of a festival for 
contemporary music, one sonification researcher greeted his audience with these 
words: “I am glad to see that you dared to approach this topic even though it has 
nothing to do with music.” Laughter ensued, but a message had been sent: 
Sonification does not equal music — even if its practitioners might take advantage 
of an opportunity to present their work in the context of a music festival. A similar 
rhetoric of demarcation can be found in quotes from my interviews: One intervie- 
wee emphasized that sonification — just like visualization — allows some aesthetic 
freedom but is “actually a technical procedure . . . it’s rule bound. And a visualiza- 
tion is not yet a painting. And likewise, a sonification cannot automatically be com- 
pared to a musical composition” (FD, 3). While participants with musical 
backgrounds are numerous and welcome at ICAD, “if it’s purely art or purely 
music, then it belongs in a different conference” (BW, 14). “People might still ben- 
efit from it; as a matter of fact in all likelihood [they] would if it’s interesting stuff. 
But you wouldn’t just take it in if it’s a good piece of music and say it should be at 
ICAD” (GK, 13). 

The question of where to draw the boundary also has to be faced by individual 
practitioners (e.g., in choosing appropriate contexts in which to present one’s 
work). After all, the crossover into musical territory can secure funding and public- 
ity but has some drawbacks as well. When I asked the media artist and researcher 
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Florian Grond whether he would make an art project out of his dissertation, he 
replied that he might consider it but would be cc very cautious”: 

If you approach chemists about sonification of chemistry, at first they always 
believe that it’s art anyway. And then it’s counterproductive, of course, if you end 
up using it as art because then it’s just difficult to communicate why one thing 
was not art and the other thing suddenly is art. (FGi, 4) 

Grond worries about muddling the message for the chemists he wants to collabo- 
rate with, but he is also concerned about communicating a clear message to funding 
agencies — after all, “y° u just have to be realistic: Being part of a system, with all 
the restrictions that system entails, simply also means that you can put things into 
practice because the system just distributes money in the best case” (FGi, 5). He is 
reluctant to tap into the kind of resources that might become available by present- 
ing sonification as an art project, as this often “has a very, very decorative function 
that is mostly on the level of infotainment and actually of a justification for all 
the money being spent on these [large-scale research] projects. It’s about public 
relations and all that” (FGi, 6). The demarcation here is not an essentialist distinc- 
tion between Science and art but a pragmatic realization that “by and by, you learn 
to detect what ‘art’ means for the art system and what ‘science’ means for the 
Science system” (FGi, 2). To succeed in either one, you have to learn the rules of 
the respective system. 


“Evaluating Ourselves to Death”? 


“Bam-bam-bam-bam-bam.” Over the phone, Bruce Walker sings me an auditory 
graph with rising pitch. I have asked him to ñame an example he likes to draw upon 
in order to convince his fellow psychologists of the valué of sonification-related 
research. Walker’s example is more than just the auditory graph itself: After letting 
them hear the graph and explaining that the sound represents money, he would ask 
his colleagues whether he was getting richer or poorer. Almost all of them would 
answer that, as the pitch goes up, the money increases. Walker would then tell them 
about a study in which hundreds of people, both blind and sighted, were asked this 
question. While the sighted respondents shared the interpretation of Walker’s col- 
leagues, the blind respondents thought the opposite. In the course of the research, 
it turned out that different mental models are at play: 

If I took a coin, and I dropped on a table, it would go <£ clink.” If I took a roll of 
coins and dropped them on the table, it would go “clonk.” And if I took a bag of 
coins and I dropped it on the table, it would go “thud.” And for [the blind 
people], more coins, more money, more — volumetrically more of anything, 
makes a lower-pitch sound. (BW, 19) 

Walker uses this anecdote to demónstrate to his psychologist colleagues that research 
on auditory perception yields interesting and unexpected results. Yet debates about 
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the role of testing auditory displays are relevant not just in the psychology commu- 
nity but also within ICAD itself. Presumably few would deny the valué of the 
research in auditory perception that Walker described; however, what is at stake is 
whether such research should be included with every sonification publication. 
Traditionally, this has not been the case; after the first ICAD conference, Gregory 
Kramer summed up as follows: 

Auditory display is a young field. As in any field, it takes time to establish a solid 
foundation of rigorous research. . . . Few of the papers at ICAD included the 
actual running of subjects along with the experimental design necessary for a 
more rigorous proof of display effectiveness. (Kramer 1994, xxix) 

Recent literature studies of ICAD proceedings indicate that “there has been a trend 
toward increasing usage of [user or participant data] testing” (Bonebright and 
Miner 2005, 518). In Frauenberger’s (2009) survey of the publications in the pro- 
ceedings of ICAD 2007, 5 all but four of twenty-three papers included some sort of 
evaluation of the designs — but that ineludes subjective evaluation by the author or 
pilot studies promising further research, as well as the “experimental design” that 
Kramer had hoped for. One of my interviewees explained why he considered it 
important to have more quantitative user tests: 

You need some way to measure what you actually achieve when you’re using 
sonification. It’s not enough that you say this, listen, this really sounds better 
than yesterday. That’s not the result. But if you can show that, when you have ten 
people doing this task, they do things 10 percent better when they’ re using the 
auditory display than when they’re not using the auditory display — that’s a result. 
(MG, 15) 

Another respondent referred to such work as having “research components” as 
opposed to doing “show and tell”: 

I always encourage papers at the conference that have evaluation components, 
that have research components, that are introspective or self-reflective. They 
look at what they’ ve done, and they try to assess it and evalúate it and figure out, 
place it within some theoretical context. As opposed to the kind of papers where 
people are basically doing show and tell. . . . There will always be plenty of 
applications, but I think we always need to push for more of the Science and 
evaluations. (BW, 3) 

It’s interesting to see what counts as a “research component” here: It has to do 
with placing one’s work in a theoretical cadre and relating it the work of others, 
but it is also strongly associated with evaluation and experimental setups. Both 
respondents quoted earlier profess to exert (gentle) pressure for such “research 
components” when writing peer reviews for conference papers. This practice, how- 
ever, is contested — as are the underlying ideas of what constitutes research. For 
instance, one respondent complained about the peer review practices at a previous 
ICAD conference: 

Many of the best sonification examples were curated out, peer-reviewed away. . . . 
There were a central stream and the póster sessions, and [many] good things 
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were sent into the póster sessions because [the reviewers] had abstruse ideas 
about evaluability and intersubjectivity. So they said, if somebody makes a sound 
and did not make a series of user tests with 17, at least, I don’t know, 17, 20 test 
persons, then we cannot accept this because that’s not scientific. It’s as if you 
would not have a graph printed if someone cannot prove that he let 17 people 
look at the graph to make sure they can see something in the graph. That is, 

I think, that’s absurd. (FD, 13) 

The comparison with visualization in discussing this phenomenon was made by 
another interviewee as well: 

The first visualizations of molecules were not evaluated; they were just made. 

And they were extremely functional. Certainly, there was some thought behind 
it and not just daubing something on paper, sure, but that’s what I mean: 

Something was done; it was extremely functional. It is — I’m not sure if the 
medial representation, if you can cali this a scientific method, but it is at the very 
least a valid scientific tool without having gone through an evaluation process. 

(FGi, 7) 

This interviewee related the insistence on evaluations to a lack of involvement by 
the application Sciences. In doing so, he took up a recurring theme in my inter- 
views: the importance of intensive collaboration with specialists in the domains 
from which data are being sonified (EB, 7; THi, 9), as well as the difficulties of find- 
ing such specialists who are willing to cooperate (FD, 17; GK, 8, 11). According to my 
respondent, the internal audience of ICAD attempts to compénsate for this short- 
coming by being extremely critical, which results in a danger of “evaluating oneself 
to death” (FGi, 7). 

The difference between the perspectives discussed earlier partially lies in a 
different emphasis on what constitutes “research.” The diversity of different back- 
grounds in ICAD manifests itself in the different motivations brought to the table: 
Depending on whether one’s main research interest is to further understanding of 
human listening processes or to explore a particular dataset, the requirements of 
empirical testing might be dissimilar; testing for “display effectiveness” might not 
mean the same thing in both instances. Not everyone even aspires to the status of 
doing Science to the same extent. After stressing that scientific strands within soni- 
fication indeed require testable hypotheses, Stephen Barrass States that the same is 
not true for all sonification work: 

In my view, sonification is a field of applied Science and design. Design is a 
problem-solving process rather than a theory-proving process. It is multifaceted 
and specifically situated. It recognizes múltiple Solutions to the same problem and 
that the designer, context, and audience are critical elements. It is iterative and 
requires evaluations at incremental decisión points. . . . Rather than a scientific 
experiment with large numbers of subjects, the designer needs to rapidly test 
alternatives with just a few subjects. (Barrass, personal email correspondence, 

July 10, 2009) 

With such problems in mind, members of the ICAD community are develop- 
ing alternative techniques of evaluations often based on qualitative methods. 
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Barrass’s work toward a “sonification design meshodology” (not a methodology 
but c< a mesh of methods”) is an example, as is recent work by Brazil and Fernstróm 
(2009). 

So, to some extent, the different takes on user testing can be ascribed to differ- 
ent disciplinary backgrounds, motivations, and notions of Ct research.” They can 
also be regarded as diverse types of boundary work, that is, as different approaches 
to the legitimization of sonification as a viable technique. This becomes most 
explicit when Dombois argües that the insistence on user testing not only debases 
the papers that inelude no such evaluations but also undermines the accomplish- 
ments of the community as a whole: “It’s like giving up something you have just 
gained, which is saying: Listening is an authority” (FD, 13). 

The argument rests upon the comparison to visualization: Sonification is just 
like visualization (in this respect), and visualization is widely used without requir- 
ing user tests, so sonification should be self-confident enough to do the same or risk 
undermining its own accomplishment of establishing listening as a trustworthy 
technique. The reference to visualization is thus used as a rhetorical resource 
for boundary work. Sonification, like medical auscultation in Jens Lachmund’s 
study, is not a purely auditory phenomenon but “an auditory [one] that, at least 
partly, fis] being shaped through the visual” (Lachmund 1999, 428). Any look at 
sonification papers or conference slides can confirm this: No less than in other dis- 
ciplines, they are replete with pictures that range from visualized sound waves to 
diagrams and photos. (The accompaniment by sound files is less widespread, 
though often requested.) If the usage of graphical techniques to transcribe sound 
into images played a fundamental role in the establishment of acoustics as a Science 
(Brain 2002; Kahn 2002), such visualization techniques continué to play an impor- 
tant role for sonification today. Moreover, the preceding analysis shows that visual- 
ization not only helps to illustrate sonification research but also becomes a tool 
for rhetorical demarcation. Visualization is presented as a competitor, as well as an 
ally of sonification research, and is invoked in order to make claims of scientific 
authority. 

The approach favoring user tests, too, is involved with a quest for scientific 
credibility: Listening is an authority, and we’re not just going to claim it; we’re 
going to prove it. Especially with its emphasis on quantitative testing, it links up to 
common strategies of quantification as a means of establishing objectivity and 
overcoming distrust (Porter 1995). Both of these approaches are therefore con- 
cerned with different ways of establishing the scientific legitimacy of sonification. 


CONCLUSIONS 


This chapter has followed sonification through different contexts: Studying 
discursive formations and strategic demeanor within the sonification community, 
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we have scrutinized the constitution and professionalization of an academic field 
dedicated to sonification; its attempts to find applications that would attest to its 
usefulness beyond this core community in a broader scientific, artistic, commercial, 
or popular sphere; the negotiation and balancing of an identity between Science and 
art; and, finally, the disputes about notions of “quality” and how to best assess it. 
These episodes add up to a story of how an academic community that breaks with 
certain conventions of scientific data display attempts to establish the legitimacy of 
its activities. 

One of my interviewees remarked that “the niche for [sonification] doesn’t 
exist at all, not yet” (GB2, 4) Therefore, anyone doing sonification simultaneously 
has to “define a field, provide the listening education,” and understand the scien- 
tific specialization from which the data are taken (GB2, 4). In order to create a niche 
in which sonification can be regarded as an acceptable way of analyzing and repre- 
senting scientific data, much work needs to be done: transforming data into sound, 
making it sound good, developing and relaying listening skills, familiarizing oneself 
with different scientific specializations, building up a field. The creation of such a 
niche can be seen as a kind of “boundary work.” According to Thomas Gieryn, such 
“boundary work” is part and parcel of any scientific field, as Science “acquires 
its authority precisely from and through episodic negotiations of its flexible and 
contextually contingent borders and territories” (Gieryn 1995, 405). 

This chapter has looked at such negotiations of flexible and contextually con- 
tingent borders in a field that sets out to ero ss boundaries as much as it draws them; 
it creates, maintains, reinforces, adjusts, transgresses, negates, and breaks down 
boundaries, often all in one move. The extent to which sonification is positioned as 
Science or something else (e.g., music) itself becomes a part of the repertoire of 
boundary work, from which its practitioners selectively draw in order to position 
and legitimize their approach. 

Indeed, perhaps the most salient characteristic of the boundary work of sonifi- 
cation is one that has been disregarded in previous studies of boundary work: The 
practitioners not only demárcate and cross boundaries but at times, if it is conve- 
nient for them, are also quite ready to submit to existing categorizations even if they 
may not agree with them — only to distance themselves again and question those 
categorizations at the next opportunity. For example, they may accept the label of 
“music” or “sound art” in certain instances in order to acquire funding or reach a 
particular audience even if they consider this a misrepresentation of their work, 
only to then reject the categorization again and insist that sonification is not music 
after all. Existing cultural attributions are thus simultaneously embraced and sub- 
verted; sonification both assimilates to and undermines existing conventions of 
scientific representation. However, as the debates about evaluation and quality 
assessments have shown, it is by no means clear how far sonification should go in 
its strategies of assimilation or subversión. 

Sonification is thus sometimes positioned in conformity with scientific con- 
ventions such as objectivity and reliability; at other times, it is emphasized to be 
more cutting edge, avant-garde, and experimental than boring oíd disciplinary 
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Science — and often both at once. Which qualities of sonification are stressed and 
which are disregarded is very much contingent and varies from one context to 
another — the result of negotiations that both the field as a whole and the individual 
practitioners deal with. To some extent, it has to do with strategic choices related to 
research funds, audiences, and publication channels, but it is also a question of 
disciplinary identity and professional “jurisdictional control” (Abbott 1988). It has 
implications for the self-perception, the composition, the contours, and the size of 
the community: Who may speak for sonification, and who may not? If the goal is to 
establish sonification and gain acceptance, is it best to ensure its publicity and pop- 
ularity (e.g., by being very open to artistic contributions) or to appear as a small but 
highly professional community of experts? This is a tightrope that has to be walked 
on many levels; it comes into play when designing sonifications (easy to use for 
laypeople or requiring highly specialized knowledge?), when assembling project 
teams, and when preparing conferences or founding journals. There is no consen- 
sus within the field of how to tackle these questions, and, indeed, they are more 
often than not dealt with in context-specific and self-reflexive ways. 

As Olga Amsterdamska (2005, 20) has indicated, “the establishment of [bound- 
aries between disciplines] creates the need to emphasize both the distinctiveness of 
a field of research and, simultaneously, its conformity with the prevailing, though 
changeable, standards or markers of scientificity.” Such a pattern is certainly visible 
for sonification, which is positioned as both distinctive from and compatible with 
notions of scientificity. Studies of emerging disciplines have shown how these have 
struggled with defining their objects, facts, and methods and how they have differ- 
entiated themselves from or associated themselves with public health (Amsterdamska 
2005), amateur Science (Wilde 1992), or common sense (Derksen 1997); for sonifi- 
cation, a comparable reference point could be its positioning vis-á-vis not only 
music but also visualization. At the same time, what is specific for sonification is 
that it not only takes on one or a few disciplines as competitors but also challenges 
fundamental conventions about scientific analysis and data display. Since sonifica- 
tion is in principie applicable to data from any scientific discipline and its claims 
therefore potentially affect all of these fields, its boundary work, too, reaches out in 
many different directions. 

As this analysis shows, the sonification community walks a rocky path on its 
way to have listening to data accepted as a legitímate scientific activity. It takes on 
many different fields simultaneously and has to face skepticism from as many sides. 
One of my interviewees remarked that scientists from other fields do not necessarily 
dismiss the possibility of interesting sonification applications; in classic “not in my 
backyard” fashion, they just tend to rule them out for their own field. Thus, physi- 
cists might be ready to concede the potential of sonification in neurology but not in 
physics (KV, 4). However, the success of sonification will depend on its ability to 
convince specialists of the promise of sonification for their own turf. The practitio- 
ners of sonification are well aware of these difficulties — henee, the widespread con- 
cern that the field should not only develop its own facts, objects, and methods but 
also come up with killer applications. 
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NOTES 


1 See Pasveer’s (2006) discussion of X-ray technologies in relation to medical listening 
practices, Myers’s (2008) research on modeling in protein crystallography, which empha- 
sizes the importance of gestures and incorporation, or Joyce’s (2008) study of MRI, which 
concludes with a brief consideration of touch, smell, taste, and sound as sources of 
knowledge about the body. 

2 I have translated this quote and several others from Germán. A table in the appendix 
indicates in which language each interview was conducted and therefore also whether 
translation has occurred. 

3 Personal email correspondence, July 10, 2009. 

4 This was especially apparent in discussions at the 2008 workshop on “Recycling 
Auditory Displays,” organized by Frauenberger and Barrass and also mentioned in several 
interviews (MG, 13; TS, 15). 

5 He took into account all eleven papers and twelve posters that presented a design 
for an auditory application, ranging from sonifications of scientific data to auditory 
games. “Papers describing purely artistic projects” were excluded since they follow a 
different rationale, where the issue of evaluation is posed differently (Frauenberger 
2009, 59). 
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Appendix: OverView of Interviews 

Ñame Function Date and Place Language/ 

Remarks 

Conny Aerts professor of asteroseismology at KU Leuven, March 6, English 

Leuven and RU Nijmegen 2009 

Gerold Baier lecturer in systems biology, University of Bielefeld, February German/first 

Manchester 21, 2008 interview 

Vienna, November unrecorded 

2, 2008 

Stephen associate professor of digital design and Paris, March 26, English/ 

Barrass interactive media, University of Canberra 2009 unrecorded 

Eoin Brazil PhD student at the Interaction Design Limerick, February English/Skype 

Centre, University ofLimerick 29,2008 interview 

Alberto de professor at Institute for Time-Based Berlin, October 16, Germán 

Campo Media, Berlin University of the Arts 2009 

Christian PhD student in Department of Sociology, Graz, March 17, Germán 

Dayé Karl-Franzens-University Graz 2008 

Florian head of Institute for Transdisciplinarity Cologne, February Germán 

Dombois (Y), Berne University of the Arts 17, 2008 





2/0 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


Appendix Continued 

Ñame Function Date and Place 


Lars composer, flautist; responsible for artistic 

Graugaard program of ICAD 2008 conference 

Florian PhD student, Bielefeld University 

Grond (Ambient Intelligence Group); media 

artist 

Matti Gróhn ICAD board member; visualization 

specialist, Finnish IT Center for Science 

John Heise astrophysicist, Netherlands Institute for 

Space Research (SRON), Utrecht 

Thomas ICAD board member; head of the 
Hermann Ambient Intelligence Group, Cognitive 

Interaction Technology Excellence 
Cluster (CITEC), Bielefeld University 

Kristoffer associate professor in medialogy, Aalborg 

Jensen University Esbjerg; conference chair of 

ICAD 2008 

Gregory founder of ICAD; Metta Foundation, 

Kramer Portland, Oregon 

Donald professor of astrophysics, University of 

Kurtz Central Lancashire 

Tony ICAD board member; sénior lecturer 

Stockman in Computer Science, Queen Mary, 

University of London 

Katharina PhD student, Institute of Electronic 
Vogt Music and physics department in Graz 

Bruce associate professor at the School of 

Walker Psychology and School of Interactive 

Computing; director of the Georgia 
Tech Sonification Lab; ICAD president 

Olivier head of room acoustics team of IRCAM, 

Warusfel Paris 


Copenhagen, May 
22, 2009 

Paris, June 26, 2008 
Bielefeld, October 

13, 2009 

Espoo, July 1, 2009 

Utrecht, March 10, 
2009 

Bielefeld, February 
22, 2008 

Bielefeld, October 

14, 2009 
Copenhagen, 

May 22, 2009 

Germán railway 
system, July 1, 2008 

Preston, November 

6, 2009 

London, November 

7, 2008 

Graz, August 11, 
2009 

Atlanta, June 4, 

2009 

Paris, March 26, 
2009 


Languagef 

Remarks 

English 

Germán 

English/phone 

interview 

English 

Germán 


English 

English 

English/phone 

interview 

English 

Germán 

English/phone 

interview 

English 
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CHAPTER 11 


INNER 
AND OUTER 
SANCTA: EARPLUGS 
AND HOSPITALS 


HILLEL SCHWARTZ 


Introduction 


“Soon after the outbreak of war,” wrote a ghostwriter in 1916, 

a strange apparition, wretched and frightening, appeared on the avenues of 
our cities. . . . His body shook as if overeóme by a great chill, or he would stand 
stock- still in the middle of a quiet Street, in thrall of experiences at the Front. We 
see others too, men who could move ahead only with the jerkiest of steps; poor, 
palé, and gaunt, they leapt as though a merciless hand gripped them by the neck. 

The ghostwriter was Franz Kafka, proposing in the ñame of his chief at the Prague 
Workmen’s Accident Insurance Institute the creation of a A Public Psychiatric Hospital 
for German-Bohemia” (Kafka 1916/2009; translation amended). Shellshock, said phy- 
sicians, could be effectively treated only in calm residential settings, beyond the means 
of most (Lerner 2003; Schaffellner 2005). The Provincial Central Administration for 
the Welfare of Returning Veterans ought therefore to prepare vacant structures at 
Geltschbad, healthfully distant ffom the city, to receive the apparitions and give them 
the time, space, and quiet in which to return to themselves. 

Kafka did not specify the quiet. It was as implicit in Geltschbad’s clean air and 
green woods as it was explicit in millennia of Hindú, Egyptian, Greco-Román, 
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Jewish, and Islamic prescriptions for the best milieu in which to heal the feverish 
and confused. Euro-American physicians, confronting a late nineteenth-century 
epidemic of nervous collapse, had taken this a step further. Accustomed to shush- 
ing nurses and patients during intent auscultations, reliant on surgical anaesthetics 
and postsurgical narcotics to mute most screaming, they had imposed a regime of 
strict silence on “neurasthenics” undergoing rest cures at home or in sanatoria. 
Such a regime harked back to the therapeutic quietudes of Quaker-run asylums and 
to the punitive-penitent quietudes imposed on prisoners under the Silent System 
(Berlin 1912; Kirkbride 1880/1973; Rosenberg 1987). 

For patients neither nervous ñor criminal, the sonicities had been fixed by gen- 
erations of Gallican Sisters of Charity and more recently by Germán Lutheran dea- 
conesses. Under the impress of vows of obedience that made of chit-chat a seductive 
jeopardy and clangor a mortal sin, nuns and lay sisters in medieval hospices and 
early modern hospitals had tended silently to worn-out pilgrims, the local sick, the 
decaying and dying. Attendants at almshouses, leprosaria, and workhouse infirma- 
ries were held to a similar sacerdotal quiet, as would be the nurses trained by 
Florence Nightingale (1820-1910), a devout Unitarian who by the age of forty had 
become a reclusive invalid sleeping beneath prints of RaphaeFs Sistine Madonna 
and Murillo’s Virgin. Her nursing school at St. Thomas’s Hospital she regarded as 
akin to a convent, and each week she scoured the progress reports on her “proba- 
tioners,” who were assessed, among other virtues, as to their Quietness, for Cí A nurse 
who rustles . . . is the horror of a patient” (Bowers 2007; Cook 1913, vol. 1, 470-74, 
486; McDonald 2004, 490-588, 750-56; Nightingale 1859, 27; 1990, 238, 385). 

Rustling was no mere matter of inexpert bustle or crinoline fashion. Touring 
Rome’s hospitals in 1848, Nightingale had taken a side trip to Santa Maria degli 
Angelí, a church whose design, by Michelangelo, impressed her as barren yet har- 
monious enough to merit an acoustic metaphor: “That noiseless (if you may use the 
word) growth of one part out of the other, which reminds one of the growth of the 
kingdom of heaven from a grain of sand, becoming a great tree, that want of bustle 
and glaring effect and impudently forcing itself upon one’s notice . . . which is so 
like the works of God Himself’ (McDonald 2004, 217, q22i, 439, 455). Projected 
into the military hospitals of the Crimea (1853—1855), then into civilian wards of 
the Commonwealth and North America (Kalisch and Kalisch 2004), thence to 
missionary clinics in Africa, Oceania, and East Asia (Hisama 1966), Nightingale’s 
sen se of noiselessness was ecological, aristocratic, and metaphysical: a permeating 
“atmosphere” (Bóhme 2000; Connor 2006) maintained on behalf of patients by 
nurses who should have within themselves the confidence to be still when nothing 
need be done. This confidence, which deaconesses at Kaiserswerth deepened 
through their Stille halbe Stunde , a daily half-hour of meditation, carne to Nightingale 
through her wealth, her elite connections, and a “divine cali” to work with “the sick 
poor,” for whom a nurse’s presence must be as commanding as it is reassuring 
(Bancroft 1890; Cook 1913, vol. 1, 485; Gilí 2004). 

Shortages of supplies, including ether, had plagued her twenty months in the 
Crimea, so the noise of men in pain in four miles of beds sixteen inches apart should 
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have been grim, but Nightingale found that “As I went my night-round among the 
newly wounded that first night, there was not one murmur, not one groan [;] the 
strictest discipline, the most absolute silence & quiet prevailed.” She enforced a 
similar sound-discipline among the British nurses and male orderlies under her 
command. (“I don’t like the word discipline, because it makes people always 
think of drill and flogging,” she would write, “but, if they would but associate it 

with the word disciple !”) When at last she returned home after bouts 

with fever, sciatica, and earache, she was on the verge of collapse; a breakdown in 
1857 left her with chronic rheumatic pain and extreme sensitivity to sound 
(Nightingale 1990, 63-157; Goldie 1987; McDonald 2002, 468; Gilí 2004, 437-45, 
probable brucellosis). 

From her Rheocline Spring Bed, she wrote fourteen thousand letters and pub- 
lished Notes on Nursing (1859), Notes on Nursing for the Labouring Classes (1861), 
and Notes on Hospitals (1863), with sections on noise that drew as much from her 
own situation (“entirely a prisoner to my bed”) as from her time in the Crimea. 
“Unnecessary noise, or noise that creates an expectation in the mind, is that which 
hurts a patient. It is rarely the loudness of the noise, the effect upon the organ of the 
ear itself, which appears to affect the sick,” so a good nurse will act with purpose 
and “always make sure that no door or window in her patient’s room shall rattle or 
creak; that no blind or curtain shall, by any change of wind through the open 
window, be made to flap.” Those with brain injuries might be bruised by “mere 
noise,” but all patients suffered intensely from “intermittent noise, or sudden and 
sharp noise.” Worst were sounds that drew attention to themselves without arriv- 
ing at closure: whispered conversations, the affectation of walking on tiptoe, “the 
fidget of silk and of crinoline, the rattling of keys, the creaking of stays and of shoes.” 
Small unresolved noises. “Irresolution,” which Nightingale detested personally and 
politically, “is what all patients most dread,” for with it carne strain, fuss, fatigue. 
“Unnecessary noise, then, is the most cruel absence of care, which can be inflicted 
either on sick or well” (McDonald 2002, 371; Monteiro 1974, 34; Nightingale 1859, 
25-33; 1861, 35-37) (see figure 11.1). 

“One thing more: — From the flimsy manner in which most modern houses are 
built, where every step on the stairs, and along the floors, is felt all over the house; 
the higher the story, the greater the vibration.” Therefore, never install a patient 
where she will hear medicine carts rattling overhead or where he may feel 
“every step above him to cross his heart.” This was important, for rich and poor 
alike avoided hospitals, whose high mortality rates would decline only with 
the recalcitrant adoption of systematic asepsis, whose antibacterial principies 
Nightingale herself disputed. She was all for cleanliness and quiet but had no truck 
with things malignly microscopic that might seduce a nurse into neglecting the 
grosser proprieties (Nightingale 1859, 32-33; 1861, 29, 42-43). 

Victorian architects had meanwhile begun designing attic or annex sickrooms 
that afforded the better-off a quiet convalescence (Adams 1996; Bailin 1994). For 
Julia Stephen, a woman of the struggling upper-middle class who composed “Notes 
from the Sick Room” (1883) after years spent tending to a dying únele, sister, mother, 
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Figure H.i Cyril Kenneth Bird (1887-1965) created this and the posters in figures 11.2 
and 11.3 for the King Edward’s Hospital Fund for London around 1958. His first 
drawing for Punch , the English satiric magazine he later edited, was titled 
“War’s Brutalising Influence” (1916) and appeared under the pseudonymn he 
used thereafter — “Fougasse,” the French ñame for a dangerously unreliable mine. 
Reproduced by the kind permission of the family and estáte of C. K. Bird. 
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and father, sleep was the nub of healing, quiet the nub of sleep: “[T]he room should 
be so gendy hushed that the patient should feel able to drop off to sleep at any 
moment” (Stephen 1883/1987, 230). Wealthier families might benefit from ordi- 
nances that allowed their servan ts to strew straw on the Street so as to damp carriage 
noise outside households tending to the ill, but the poor sick had to abide the 
thin walls of tenements, stertorous neighbors, street-lively neighborhoods. Poor, 
middling, female, black or brown, few had the luxury, at home or in hospital, of 
a a room of one’s own,” as later advocated by Stephen’s rest-cured daughter, 
Virginia, who, like Kafka, appreciated how a veteran of the Great War might be 
wracked by the blast of a doorman’s whistle or backfire from a passing lorry 
(Kingsdale 1989; Lee 1997; Woolf 1975). 

Nervous, headachy, tubercular (he had a “cardiac neurosis”), Kafka was no 
stranger to hospitals and sanatoria, whether in his capacity as an insurance lawyer 
or in his own person, and wherever his digs, he found himself as tormented by noise 
as any apparition (Gilman 1995, 93-95; Kafka 1974). “The amount of quiet I need 
does not exist in the world,” he wrote his sister Ottla, “from which it follows that no 
one ought to need so much quiet” (Karl, 1991, 85, 394-96). In his parable of 1917, 
“Silence of the Sirens,” Odysseus at first eluded the Sirens by orneriness and 
Ohropax earplugs, bound to the mast by chains and sailing staunchly on. Homer 
had granted his hero the heroism of listening to the Sirens, whence the chains; 
during the Great War, heroic listening was no longer credible, whence the ear-peace 
of the Ohropax (Ohropax GmbH 2010). Then Kafka bethought himself of klaxons, 
modern sirens that warned of air raids or poison gas. It occurred to him that Sirens 
might be at their most irresistible when silent, seducing men from rough seas or 
deafening barrage by the promise of stillness (Kafka 1917/1971, 430-32; 1917-1919/1991, 
19-20). After the war, after years of apparitions rising out of trenches, Kafka wrote 
Der Bau , about a burrow and its burrower: “The most beautiful thing about my 
burrow is the stillness. Of course, that is deceptive. At any moment it may be 
shattered and then all will be over.” Listening for sounds of cave-in or incursión, 
the burrower hears an indefinable whistling, “audible everywhere, night and day,” 
ominous (Kafka 1923/1971, 327, 353) (see figure 11.2). 

Burrower and hero, burrow and mast: These are the figurations between which 
to hear out the history of the hospital and earplugs. Hospitals were, in this regard, 
sanctuaries from the noise of the world, their patients burrowed under blankets and 
trusting to devout nurses for protection against acoustic insult. Earplugs were sen- 
tries thrust into the meatus, inelegant guardians of an elegant labyrinth, an inner 
sanctum that was sound’s lymphatic burrow and rebirth. Hospitals and earplugs 
shared a tactile origin — a healing touch; hands clapped to terrorized ears — but the 
more heroic the medical posture of physicians and the more technical the training 
of nurses, the more the sound history of hospitals would diverge from that of 
earplugs. When hospitals were primarily places for lyings-in, amputations, and 
recuperations, or (as lazarettos) for isolation and palliation, or (as asylums) for safe 
enclosure and rest cure, they were almost as passive in their acousti-cultural archi- 
tecture as earplugs, though intended as much to contain screams from within as to 
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Figure 11.2 During World War I, Fougasse (C. K. Bird) had his back shattered by 
a shell at Gallipoli; thus, he had direct experience of hospital acoustics. Póster reproduced 

by the kind permission of the family and estáte of C. K. Bird. 
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block noise from without (Leistikow 1967; Risse 1999; Stevenson 2000). When 
hospitals touted themselves as major research laboratories jealous and zealous of 
new procedures, they became industrial cities, with the attendant environmental 
problems. In short, hospitals became noisier as physicians became more invasive in 
their diagnostics and more aggressive in their treatments, while earplugs and muffs 
became more effectively defensive with the patenting of lighter-weight, sound- 
damping materials that fit more snugly. Only during the 1990S were earplugs put on 
the offensive against noise, by which time the sonic trajectories of hospitals and 
earplugs were again running in tándem. 


Outer Sancta: The Hospital 


The original tándem had been joined by anti-noise campaigners since the 1840S, for 
anti-noisites invariably enlisted infants and invalids in their briefs against dutiful 
daughters pounding out piano scales in upstairs apartments, maids beating carpets 
on balconies, organ-grinders at the curb. All who needed sleep to thrive or survive, 
all those with finely tuned minds or artistic sensoria — in brief, all those who, like 
the English philosopher Herbert Spencer, might be drawn to the defensive use of 
earplugs or muffs — deserved help from public authorities to eliminate offending 
sounds at their source (Duncan 1908, 79, 314). So anti-noisites gathered testimony 
from hospital patients, nursing mothers, harried businessmen, and ailing great- 
aunts who attributed colic, insomnia, irritability, and general debility to pestiferous 
sound (Bijsterveld 2008, 27-90; Schwartz 2011). Noisites, who professed to delight 
in the sonic robustness of civilization on the make, could scarcely attribute to 
infants and young mothers that “morbid sensitivity” to sound through which 
they — and the Bench — often dismissed noise complaints. 

Anti-noisites also bemoaned the indefensibility (or “inadjustability”) of the 
ear, which had no lid or lips and did not sleep (Hazlitt 1930; Schwartz 2003). Since 
plugging the ear with a finger, stuffing it with cotton wool, or enveloping it with 
muffs minimally buffered oncoming noises, and since sound was conducted 
through the bones of the skull as well as through the ear canals, there was only so 
much the sound-sensitive could do before leaving the field to the imperium of 
noisemakers. 

Imperium: the command of space and time, which had been Nightingale’s 
hardest-fought battle as she sought for her head nurses a paramountcy in the wards 
second only to physicians. Toward this end she promoted a pavilion style of hospi- 
tal architecture that gave each head nurse her own demesne and the closest scrutiny 
of patients (Nightingale 1863). European hospitals had previously followed the 
model of army barracks, monastic hospices, and church naves, with beds arrayed in 
long columns, one or more aisles between, and with provisions for natural light, 
access to “necessaries,” and ventilation adequate to feed the lungs while dispersing 
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odors and diffusing those deadly miasmas that gave Nightingale the willies. Medieval 
and Renaissance architects, working for pious hospital patrons, had also been 
directed to sitúate a chapel so centrally that patients could see or at least hear the 
recital of the Mass, its ritual thaumaturgy no less vital than physick. Privacy, when 
and where accommodated, was ensured by curtains around bedsteads or, in cases 
of high privilege, by sepárate cells, but the new demand for privacy that aróse among 
the middle classes in the mid-eighteenth century was not yet so strong that 
Nightingale thought it determinative when she laid out the wards of public hospi- 
tals in the mid-nineteenth (Abel-Smith 1964; Taylor 1991). The pavilion model, 
which she rather adapted than invented (Risse 1999), did presume an intímate 
supervisión over fewer beds, but her emphasis on surveillance required an open 
floor plan, which was all the more reason to incúlcate habits of quiet among 
nurses and orderlies (Billings 1895) — and to establish, as planned in 1863 for the 
“Nightingale” State Emigrant Hospital of Manhattan, a sepárate nurses’ room and 
staff quarters, “thereby equalizing the temperature of the wards, and shutting off 
the noise of the halls, staircases and sanatoriums; affording likewise a modérate 
degree of conversation and amusement among the convalescents, without disturb- 
ing the weak, helpless, and suffering patients, who are unable to leave their beds” 
(Kennion 1868, pt. 3, 77). Where patients themselves were unruly, shrieking in pain, 
or unavoidably loud (with whooping cough, for example), she would have them 
assigned to their own open, mutually noisy wards. Prívate rooms she distrusted 
as warrens in which dirt could hide, miasma regroup, defiant patients conspire 
(McDonald 2002, 458, 468). 

No more than three stories high (to ensure ventilation), pavilion hospitals 
expanded outward like army encampments. This could rarely be achieved in city 
centers, which was fine with Nightingale, who had grown up in a country manor 
and believed that hospitals belonged where air was sweeter and the quiet timeless. 
For fear of infecting or offending townspeople, medieval hospitals with their mor- 
tuaries had been built outside town walls; modern hospitals, divorced from morgues, 
were to be situated in the countryside for fear of cities disturbing their patients — a 
reversal neater in theory than in practice. Most new hospitals after 1850 aróse where 
the need was greatest and patients nearest, in metrópoli whose density constrained 
them upward (Abel-Smith 1964; Risse 1999). The new verticality was enabled by the 
advent of steel-frame construction, “safety” elevators, electric lighting, forced-air 
furnaces, and huge rooftop fans, whose ratcheting, flickering, thundering, screech- 
ing, and whoosh paradoxically added to the noise of cities, through whose avenues 
roared hospital ambulances with dúplex sirens (Hornsby and Schmidt 1913). 

Ambulances and their hospitals proliferated during the Belle Époque in 
response to an increasingly urban world beset by the derangements of density: road 
and rail accidents; skin, ear, and lung diseases from bad air; infections spread 
through the overwhelmed plumbing of tenements. As hospitals multiplied, they 
were parsed into medical specialties: contagious diseases, geriatrics, maternity, pedi- 
atrics, psychiatry, syphilis, tuberculosis. Anti-noisites exploited them all as wedges 
toward the passage of zoning laws that would keep industrial and commercial 
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activity remóte from the residences of good people. In 1905, New York did boast a 
section of city code that forbade peddlers from bawling their wares within 250 feet 
of a church during Services, a court in session, and a hospital at all times, but this 
went largely unenforced, like other noise laws, for want of patrolmen and standard 
audiometrics. Lawyers and architects had had some luck forcing the relocation of 
hospital operating rooms from streetside rooms to walled-off interiors where 
screams could not be heard; hospital administrators had had no such luck in damp- 
ing the clamor of nearby traffic (Gutton 2000, 88-90; Kestner v. Homeopathic 
Medical and Surgical Hospital , 245 Penn 326 [1914]). The eventual designation of 
hospital a quiet zones,” with bold signage and stiff fines, would lay the groundwork 
for a policy of civic sonic set-aparts in other public contexts. 

“Zones” had been on the lips of astronomers for a century, of geologists for 
eighty years, of railmen and diplomats for twenty, of timekeepers and trolley con- 
ductors for ten. While towns since Jericho had relegated smelly and noisy occupa- 
tions to precincts just within or beyond their walls, and while Frankfurt and Berlin 
in the 1890S had passed ordinances specifying the functions of buildings in each 
ring of development around the city hub, American urban planners had still to 
defend the legality and morality of explicit zoning. Everywhere, however, cities were 
ipso jacto zoned by the nature of the environing sounds, according to which dis- 
tricts were paved and with what — asphalt, brick, wood blocks, or cobblestone. 
Hospital quiet zones were in this sense a sound-remarking (Bijsterveld 2008, 243-53; 
Peterson 2003, 308-17; Rodgers 1998, chs. 4-5; Sandweiss 1997). 

Julia Barnett Rice was credited across the Atlantic ecumene for making the 
re marking culturally persuasive. A gradúate of the Women’s Medical College of the 
New York Infirmary, she never practiced medicine. Instead, upon marrying the 
pianist, lawyer, and venture capitalist Isaac Rice, her career was more broadly that 
of “helping other people” and publishing The Forum: A Magazine ofPolitics, Finance , 
Drama , and Literature , a leading Progressive journal. Her run-in with noise, aside 
from raising six rambunctious children, carne after she and Isaac in 1902 built a 
mansión overlooking the Hudson. If Manhattan’s Street traffic was loud enough to 
bump wealthy families off central arteries toward the banks of the Hudson or the 
East River, river traffic had its own clamjamfry with “the shrieks of passing steam- 
ers, the discordant notes of harbor craft, the puffing and wheezing of tugs, the din 
of escaping steam, clanging bells, howling men.” (Van Dyke 1909, 32). Back from a 
tour of Europe in August 1905, Julia and her sleep-deprived children were assailed 
by ever-more strident whistles, steam sirens, and compressed-air horns from a flo- 
tilla of tugs. Not one to seethe in silence, she resolved to elimínate the noise, though 
cognizant that a campaign for quiet issuing from the mistress of Villa Julia would 
be subject to innuendos from the Left of ethnic prejudice or bourgeois maternalism 
and from the Right of hysteria or economic naíveté, objections voiced during prior 
anti-noise campaigns in London, Boston, and Chicago. She therefore deflected the 
debate from her person (Southern, Jewish) and position: not the wealthy matron at 
her high window but the indigent and infirm (immigrant, Black, Irish, Italian, 
Polish) in hospitals along the Hudson, and hundreds of sick poor children (ditto), 
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none of whom could sleep while tugs blatted at two a.m. for crews in saloons (Gatlin 
1913; Heinl 1908; Lomax 1996; New York Daily Tribune y November 29, 1905; New 
York Times , December 10, 1905; New York Times Magazine , January 14, 1906; Rice 
1906 [both]; Seidler 1989; Sloane 2008). 

Julia visited the Gouverneur and Bellevue hospitals on the East River, the 
Children’s Hospital on Randall’s Island, and the Metropolitan Hospital on 
BlackwelLs Island, all of which lay near tugboat moorings. She called on the Board 
of Health, Pólice Department, and dock commissioner, armed with testimoniáis 
from superintendents of hospitals “representing the sufferings of 13,000 poor 
people” and letters from neurologists about the danger of shrieking whistles — at 
Bellevue, patients mistook the whistling for their death knells (Morse 1906; Rice 
1906 [both] ). The city’s chief medical officer, Hermann M. Biggs, who had expanded 
its program of school medical inspections to test for hearing loss, was “thoroughly 
in sympathy” with Mrs. Rice. Less reclusive than Nightingale and more adept with 
reporters, Julia made sure that Biggs’s own specialty, medical pathology, would be 
in play: “The bacilli which haunt the air are not worse enemies than the sound 
waves that quiver in the air and beat in upon our brains,” wrote the editors at the 
New York Daily Tribune (November 26, 1906) after listening to Julia. “There are 
benign bacilli, but there are no benign noises.” All Biggs could do was to ask Corpo- 
ration counsel about his jurisdiction, if any, over noise arising on the river. Counsel 
dithered. 

An impasse, was this, or a signal moment? It is common nowadays for noise to 
be characterized as “liminal,” as sounds that teeter on the edge of meaning, fitful of 
place and fretful of time. Liminality was not what Julia Rice had in mind when she 
campaigned against noise, ñor was it what decades of New Yorkers understood 
when protesting doormen’s whistles, itinerant brass bands, or church bells. They 
knew what the noises meant; they knew well where/ who/ what the noises carne from; 
they knew very well how much earlier than dawn or later than dusk the noises 
aróse. Indeed, knowing the time, place, and meaning of the noise was crucial, for 
only then could they argüe, as they did, that the sounds carne at the wrong time, in 
the wrong place, in excess, or as atavisms. Not liminal: obtrusive and inexcusable. 
A campaign against frustratingly unassailable whistles or sirens had to be an asser- 
tion that no engineered sound should escape human command. If tug whistles 
resembled “the wails of lost souls in torment, or children in terror, or wounded 
horses, or distant Scottish bagpipes, or the Banshee,” one ought not grant them a 
legal limbo. True, the intensity of aural sensations had then no universal metric. 
Still, attention had to be paid and arguments made if banshees were not to haunt 
the recuperative sleep of the thousands in hospitals ventilated by invigorating river- 
side air (Lomax 1996; New York Evening Sun , November 27, 1907). 

When propagated on interstate waters, noise turned out to be a federal issue 
requiring a revisión of the U.S. Code to give municipal officials the authority to 
squelch local whistling — a revisión that Congress, at Julia’s instigation, did pass and 
Theodore Roosevelt signed into law. Julia meanwhile had been contemplating her 
next step: the formation of a Society for the Suppression of Unnecessary Noise 
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(SSUN), which would go after all sorts of noises and appeal to the public to relieve 
“the intense suffering of our sick poor from the noise-evil.” This would entail a 
push for the creation of sound-protected areas, or quiet zones, around hospitals 
(New York Daily Tribune , December 4, 1906; New York Times , June 2 6, 1907; Rice 
1906 [both]). 

June of 1907 saw the first official signs (“NOTICE— HOSPITAL STREET”) 
posted around each New York hospital. From Manhattan such signage spread to 
Chicago, London, París, Hannover, Amsterdam, and Milán. Patrolmen on hospital 
beats were to insist that horses be walked, newsboys rerouted, motorists fined for 
honking. Realizing that there could never be enough officers to make the signs other 
than cautionary, Julia got twenty thousand schoolchildren to compose pledges: 
no pounding of hospital fences with sticks, no roller-skating on hospital sidewalks. 
“I promise not to play near or around any hospital,” wrote a child eager for 
membership in the Children’s Hospital Branch of the SSUN and a smile from 
Mark Twain, figurehead of the campaign. ct When I do pass I will keep my mouth 
shut tight , because there are many invalids there” (New York Times , June 26, 1907; 
February 27 and April 6, 1908; Rice 1908 [both]; Society for the Suppression of 
Unnecessary Noise 1908, 3.) 


The Noise from Within 


Alas, quiet zones around hospitals could not suppress the new noises arising 
from within. Self-noise, an issue of moment in twentieth-century physics and 
experimental psychology, became an equally serious issue for hospitals as they 
modernized. The greater the efforts of administrators to implement antisepsis with 
sterilizable metal or Pyrex apparatus, easily mopped floors, and impermeable coun- 
ters, the louder and more reverberant became their hospitals. The more strenuous 
their effort to centralize Systems of accounting, distribution, and communication, 
the noisier became their hospitals with devices of acoustic repercussion: the pound- 
ing of keypunch machines; the clattering of metal carts; the clunk of dumbwaiters; 
microphones and wall speakers fraught with feedback and interference (Govan, 
Anderson, and Reilly 1934). And the taller the hospital buildings, climbing upward 
to serve populations encouraged by modern medical “miracles” to enter diagnostic 
wards and maternity wings, the lower the ceilings and more blatant the HVAC. 
From the outside, hospitals could seem massively quiet; within, they were louder 
and shriller — or would seem so to the acutely ill and injured, who were becoming 
the primary clientele (Adams 2008; Hornsby and Schmidt 1913, 109) (see figure 11.3). 

Shunted to nursing homes or back to families with cupboards of off-the-shelf 
analgesics, the chronically ill and/or debilitated would no longer occupy the major- 
ity of beds; instead, there was a more diverse, demanding hospital population, 
anxious for action and addicted to telephones and radios. Contrary to Nightingale’s 
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directives but at the insistence of well-heeled patients and donors, prosperous hos- 
pitals incorporated more semiprivate and prívate rooms. The better-off would no 
longer have to endure the rude hawking, coughing, snuffling, snoring, and shriek- 
ing common to open wards or tolérate noises they associated with lower classes, 
immigrants, and other races. Since the newly enclosed rooms were out of nurses’ 
audiovisual range, hospitals had then to install bedside cali buttons, monitors, and 
alarms, compounding the cacophony at and around nurses’ stations (Coser 1963, 
248; Goldwater 1932; Gorrell 1936; Griffin 1950; Underwriters Laboratories 1979). 

Toileting, too, became more prívate, and no one had a cost-effective solution 
for the abominable acoustics of flushing, at the bowl or throughout the innards of 
a building, even where (as in England) drainpipes were bolted to outside brick. To 
isolate throbbing conduits from contact with other internal ducts was forbiddingly 
expensive, and industrial-strength hospital toilets were far louder than those at 
home. For bedridden patients, the clangor of new stainless-steel bedpans — slick, 
oblong, hard to stack, and often dropped — was more discombobulating than the 
shatter of earlier crockery. 

It was a toss-up, depending on country, culture, and economic circumstances, 
as to whether rough wooden floors would be replaced by linoleum or terrazzo; 
creaking wooden bedsteads by adjustable metal; slamming shutters by flapping 
blinds; gas jets by incandescent or fluorescent lights; wooden bowls by Bakelite or 
tinplate; dust mops by vacuum cleaners. If they chose linoleum and Bakelite, hos- 
pitals could claim small victories with regard to noise control; terrazzo and tinplate 
meant pandemónium (Grumet 1993) (see figure 11.4). 

At their most technically sophisticated, hospitals were intrinsically noisier: 
With each new device for diagnosis, monitoring, or treatment, the peskier would be 
the background hum and the louder or beastlier the foreground of (sometimes lit- 
erally) bells and whistles. If physicians frowned upon faradic current generators 
and other Victorian electrotherapeutica, modern hospitals would proudly resound 
with X-ray machines, electric floor polishers, intercom paging, air conditioners, 
and mobile trays of metal and glass (Agnew 1974; Howell 1995; Kingsdale 1989; 
Rosenberg 1987; Stevens 1918; Kersbergen 1931 for images). In sum, the machines 
conduced to exactly that bustle deplored by Florence Nightingale, Julia Barnett 
Rice, Julia Stephen, and Virginia Woolf. 

Should one put a gendered finger on the anti-noisite pulse as felt in the hospi- 
tal, one might suppose that educated women were extending an age-old guardian- 
ship of household peace into institutional sick rooms. Or that, as domestic 
economists and “municipal housekeepers,” modern women were seeking to effect 
a quietness that was elsewhere a sigil of efficiency. Or that the “civilizing mission” 
of women called for a quiet as restorative as that reconstructive surgery refined 
during the Great War, the ears tucked back and in. But insofar as anti-noisite 
impulses were gendered female — and many a male sanitary engineer, architect, and 
physician of the twenties and thirties was equally unhappy with hospital noise 
(Censorinus 1938; Dixon 1927; Horder 1938; League of Nations 1937; Neergaard 1929, 
1932; Stevenson 1936; Tanon 1938; Thompson 2002; Tucker 1929) — it may be more 
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valuable to consider quietude as invoking two experiences rare in most women’s 
daily lives: uninterrupted time and insulated spaces. For women, more than men, 
the qualia of quiet resonated with the promise of self-healing and a room of one’s 
own, which might be possible only in the hospital (Hofrichter 2006; Ibels 1916). 

After World War II, campaigns for quiet within hospitals accelerated even as 
truck and motorcycle traffic exploded through quiet-zoned avenues. With ears sen- 
sitized by the battlefield experiences of physicians, the combat stress of veterans, 
and the psychiatric training of war nurses carne not only more powerful (electric/ 
electronic) hearing aids but also subtler measurement of the effects of acoustic 
aggravation on blood pressure, blood chemistry, adrenal excretion, and coronary 
and cortical activity (Ades 1953). 

Armed with antibiotics that allowed for more intricate operations and better 
management of infection, surgeons and internists repositioned hospitals as spaces 
of urgency and potency whose quiet dramas attracted donors for new wings that 
housed noisier if otherwise more advanced technologies (Association des Hautes 
Études Hospitaliéres 1968; Falk and Woods 1973; Fife and Rappaport 1976; Gendre 
1976; Golub 1969; Haslam 1970; Hinks 1974; King Edward’s Hospital Fund 1958; U.S. 
Public Health Service 1963). 

Of particular urgency and potency was obstetrics, which since the 1890S had 
been improving its skills at maintaining infants born prematurely. Where each 
Intensive Care Unit became a hospital within a hospital — Nightingale’s legacy of 
I-See-You surveillance — each newborn intensive care unit (NICU) was that much 
more metonymic: The more transparently a preterm infant could be kept alive and 
thriving, the more appealing would be the hospital itself as an artificial womb 
(Russell 1977). Once ultrasound monitors became common and prospective par- 
ents could see that fetuses at six months were acoustically alive to the aerien world, 
investigators began to attend to the sense-surround of infant incubators and dis- 
covered that these were among the loudest of hospital environments, louder even 
than the bellows of iron lungs for those stricken with polio (Baker 1996; Vaizey 
1986). The premature fetus and the neonate, like other patients, must suffer the 
indignities of a raucousness produced by the very devices that kept them safe — 
fiercely ventilating pumps, beeping monitors, the flap-slap of hinged glass (Falk 
1974; Frank 1991; Kahn et al. 1998; Michaélsson, Riesenfeld, and Sagrén 1992; Seleny 
and Streczyn 1969). 


Inner Sancta: The Earplug 


We are back to the sonic insecurities of the burrow, which were, inside-out, the 
sonic insecurities of earplugs. Like ear trumpets and other obstreperous hearing 
aids, earplugs invoked contradictory stigmata: Either the users were prematurely 
oíd and cranky or parading their hyperacuity in hopes of special dispensation; either 
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Yelled the Coionel 


at the Prettiest Nurse 
\ on the Staff 


—a bunch of idiota who spend half their time P ltals 18 onl y one P hase of our aetivity. 
beating on the pana wíth spoons? — And ií 1 For adviee and Information on the solution 

ever catch the fellow who rings the bella on of any problem regarding unwanted sound 
that elevator I’ll blast him with an elephant or acoustical correction— in any building of 
gun!~To say nothing of that enunciator that any typeorsize— consultyour Celotex Sound- 
contimially squawks» Dr. Parsom—Surgery Conditioning representativa. Heiaamember 
— Dr. Pamons—Surgery! Why, the Anl Tor- of the most widely experienced acoustical 

ture of the Congo tribes doesn’t organization in the world, 

cotnpare $vith the noise around here!’ 

“Of course, all I could do at the 
time was try to soothe our distin- 
guished patient and caution ever y- 
one to be as quiet as possible. But at 
the next meeting of the Hospital 


Fttf OfFERI The itlustrated magazine, “Quiet 
Forum tvhick rrcounts the eme histories of 
many sound-conditioning installations , mil 
besenttoyoufreeonrequest. Learn hmo modern 
sound-conditioning is contributing to profit 
and comfort all <mr America. Address; The 
Celóte x Corporation , Chicago, Illinois. 


‘/^OLONELMARTiNAhefamousbiggame Board I got some real adion a~bout Sound- 
AJ hunter, was given just about the Conditioning Our corridors, 
most convenienl room in the hospital. He was “A few visita to other hospitals in towfi hnd 

near the diet kitchen— »so he got his fctay ^ ^someconversationwiththeirSuperintendents 
first. He was neár' the elevator beesuse he prompted us to calí in the Celotex Sound- 
had a lot of visitora. The room was bright, Conditioningpeople.Theyanalyzedourprob- 
and we thovght he was happy, lem quickly and did a fine job. Incídentally, 

“Then one day Miss Page, the prettiest Nurse Page is still here, but the Coionel and 
nurse on the staff, carne into my office in the disturbíng noise have both Scrammed 
team, Coionel Martin, it aeems, was jn. a — S s q 

roaring rage. And when I went to investígate Celotex Sound-Conditioning serves soci- 
he started on me! . ety, business and industry in many ways. 

“ ‘Who works inthat kitchen acrossthehaíl Quieting noisy corridors and rooms in hos- 


Figure 11.4 The coionel in this 1942 advertisement likely had a prívate room 
and high expectations of hospital quiet, which could be met, suggested the 
Celotex Corporation, through the installation of its sound-absorptive ceiling tiles. 
Reproduced by permission of the CertainTeed Corporation. 
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the users were misanthropic, holding people at a distance, or malingerers, conning 
them closer. Apart from such stigmata, there were practical dissuasions against the 
use of earplugs even in the noisiest of contexts, on battlefields, in boiler faetones, at 
the power looms of textile milis (e.g., Holt 1882; Rodger 1915). Soldiers, sailors, 
engine mechanics, and artisans claimed for safety’s sake, as well as for quality con- 
trol, that, like musicians, they needed to hear the subtlest of sounds (Bijsterveld 
2008; Boys 1915; Guild 1951). Further, it was a point of pride to men who labored 
among drop forges, printing presses, or metal lathes that their ears grew “stronger” 
over the years, and at home they seemed none the worse for wear. Besides, plugs 
themselves swiftly became the worse for wear: Gummy with wax or body oils, they 
attracted dust, dirt, and half-dead bugs that could infect the ear canal, producing 
the smelly effusions of otitis media, which, untreated, could cause hearing loss and, 
backing up into the brain (said some), meningitis or brain rot. Then there was the 
discomfort of plugs too tight, the uselessness of plugs too loose, their ineffective 
shielding from low-frequency, bone-shaking rumbles. Finally, although plugs had 
long been recommended to prevent coid salt water from getting into the ear during 
ocean swimming, there was little evidence that the use of earplugs in the long run 
prevented hearing loss since it appeared that, regardless of profession or circum- 
stance, men and women (city men and women, at least) lost acuity as they aged 
(Bauer and Edgley 1939; Behar, Chasin, and Cheesman 2000, 26-30; Knudsen 1939; 
Knudsen, Gales, and Watson 1941; Warfield 1948). 

Related to this finality was another, more covert reluctance: partial deafness. 
A majority of modern populations — industrial laborers, farmers driving tractors 
and combines, housewives in homes aswirl with flakes of lead paint, children who 
survived scarlet fever or suffered from rickets — already had a degree of deafness 
that would seem exaggerated by earplugs. That is, persons ignorant of (or undis- 
turbed by) their hearing problems might be more deafened — or troubled — by the 
most innocuous of plugs or made more aware of the distressing tinnitus that occurs 
in half of all cases of noise-induced hearing loss (Schwartz 2011). 

Inserting plugs was eargo a risky business, sociologically, otologically, psycho- 
logically. Where the loudness or shrillness of recurrent sounds was demonstrably 
deafening and disabling, with episodes of disorientation, loss of balance, and/or 
ringing in the ears — as with naval gunners, tank captains, artillerymen — then the 
use of “ear defenders” was less fraught, and orders were issued. During the Great 
War, when ear defenders were thought capable, perhaps, of reducing the incidence 
of shellshock or at least of tempering its sound-cued aftermaths of jerking and 
nightmare, plugs were sometimes requisite and frequently requisitioned, but most 
soldiers in the trenches did not cotton to the idea of putting mud-sopped plugs into 
ears rocked by hours of shelling that reverberated through their new metal helmets. 
Aside from gunners, those who did use plugs most regularly were pilots, who were 
expected nonetheless to return from the war with permanent hearing loss (Grant 
1917; Fíorne 1914; Jaehne 1911; Trible and Watkins 1919). 

How much hearing was lost due to the inconsistent or imperfect use of ear 
defenders during the Great War we will never know, as hearing tests at induction 
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were perfunctory and at discharge were done only if pensión claims were filed. 
Statistics in any case rarely convinced people to use earplugs even aíter electrical 
audiometers carne into general use during the 1930S. By then, other technocultural 
habits were making people comfortable with having something about their ears. 
First, headsets, as worn by switchboard operators, by navy men listening for 
submarines, by secretarles at dictaphone desks, by home audiences sharing lines 
to early radios (Sterne 2003), and by schoolchildren listening for audible tones 
produced by audiometers. Next, telephone receivers themselves, to which one cud- 
dled one’s ear not only to hear more clearly but to block noises from outside. 
Next, miniaturized hearing aids made to fit almost invisibly within the meatus. 
Finally, commercial airplane travel in cabins so noisy that earplugs were de rigueur 
(Arnhym 1944). 

During the Second World War, the most mechanized and industrial of wars, 
more men and women had good experiences with headsets and acquired also a 
healthy regard for ear defenders (Littler 1940; Knudsen 1939). Afterward, they had 
the benefit of a new war material, synthetic foam, that improved the fit and comfort 
of plugs and of headphones for portable radios. The strongest incentive toward the 
use of earplugs, however, carne from having to tend to thousands of enlisted men 
who had returned from the war hard of hearing and had to be “reconditioned” or 
recompensed. Not that any statistician could prove that the use of earplugs would 
have prevented most cases of partial deafness but that otologists now had firmer 
evidence of noise-induced hearing loss due to long exposure to industrial-strength 
sounds and short but repeated exposure to high-frequency or percussive sounds 
(Luz, Decatur, and Thompson 1973; Prasher 1998). Although industrial trade asso- 
ciations and some researchers were still vouching for the marvelously recuperative 
powers of ears (Snow and Neff 1943), the military and the higher courts were increas- 
ingly inclined to acknowledge that hearing loss was not exclusively a matter of ageing, 
blunt-force trauma, or ototoxicity. Partial and compensable deafness could result 
from specifiable lengths of exposure to specifiable decibel mínimums at specifiable 
frequencies and could be forestalled with noise-control programs that involved the 
erection of sound-deadening panels on the shop floor and requiring workers to 
wear earplugs (American Academy of Ophthalmology and Otolaryngology 1973; 
American Industrial Hygiene Association 1973; Associated Industries of New York 
State 1956; Employers Insurance of Wausau 1957; Robinson 1971). 

Unions encouraged workers to wear the plugs so as to demónstrate that certain 
work environments were unfriendly to ears and to lay the basis for future compen- 
sation claims for cumulative hearing loss on the job. To the extent that workers 
during the fifties and sixties went along with blue-collar unions on this, earplugs 
were rather a pawn in bargaining over factory conditions and workers’ compensa- 
tion than a means of avoiding deafness (Frisch 1993, 115-17, 132-33; Mine Safety 
Appliances Company 1954; Braun, this volume, who also furnishes data on the 
effectiveness of plugs). 

Though foam-fitted and less easily infected, earplugs still did no more than 
they had done for millennia, impassively blocking the passage of a range of 
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vibrations through the ear canal. A more active approach, imagined for centuries, 
began to appear feasible during World War II as physicists and psychoacousticians 
studied masking techniques and noise cancellation in connection with the jamming 
of enemy radio, radar, and sonar. Masking could be sheerly electrical, smothering 
sounds under the white noise inherent in electrical devices and as easily amplified 
as in cheap battery-powered hearing aids, where static was rather a problem than a 
solution. Noise cancellation required sophisticated electronics in order to accom- 
plish a series of instantaneous analyses of incoming sounds such that exactly inverse 
waves of different amplitudes could be produced to negate each successive offend- 
ing vibration. Such “active noise control,” first implemented for the quieting of 
commercial jet engines, could not be effectively applied to so small a feature as an 
ear until microprocessors had reached a third generation of development. Even 
then, noise cancellation was best achieved with a headset or pair of muffs about the 
size of those cosmetic powder puffs sewn into pilots’ earflaps during the twenties on 
the advice of panjandrums of aviation medicine (Bauer 1926, 157). 


Conclusión: Burrow and Mast 


Active noise control made of the earplug something of an antenna — or a Homeric 
mast — for sensing and silencing sounds that threatened the integrity of the cochlea, 
while the well-insulated, musically accomplished headset protected the absorption 
of the listener. No more a burrower, the noise-canceling plug or muff stood like a 
guardián automaton at the entrance to a classical labyrinth, programmed to repel 
all intruders. If this was rather more wily than heroic, so was the program of care in 
late twentieth-century hospitals, where noise cancellation was technically still a 
Nightingale dream (Busch-Vishniac et al. 2005; McCarthy, Ouimet, and Daum 
1991) and nurses had begun recommending earplugs to their patients (Haddock 
1994). Active noise control meant rather a repudiation of hospitals as burrows and 
a vigorous effort to discharge as soon as possible the most obstinate of noisemak- 
ers — the loudly demented, the chronic coughers, the second opinionators (Mizrahi 
1986; Stevens 1989; Strumpf and Tomes 1993). The motive for rushing patients out 
the door was to some extent the rediscovery that hospitals themselves — due to 
incestuous HVAC systems and a democratic openness to visitors — were significant 
sources of secondary infection. To a greater extent, rapid discharge was motivated 
by an ideology of urgency. Except for lucrative cosmetic or “optional” procedures, 
hospitalization now implied swift intervention involving invasive tests and sound- 
ings, complex arrays of drugs, and/or surgery, an expertise in which hospitals prided 
themselves and, in some countries, profited highly therefrom (Betsky 2006; Miller 
and Swensson 1995; Rosenfield 1969). 

Convalescence from such traumatic doctoring could be relegated to auxiliary 
institutions where the need for quiet was not played out against a calculus of 
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mortal peril. So hospitals, too, began to be re-presented as antennae and masts. 
No longer refuges for the weary, decaying, or dying, they were “centers” for exqui- 
site detection and Oddyssean mainstays for hairsbreadth escapes (Knipping and 
Kenter 1967, 14-38; Verderber and Fine 2000). 

As embodiments of two distinct modes of dealing with noise, the individual 
and the institutional, earplugs and hospitals meet up time and again at precisely 
that intersection at which Kafka put his hapless heroes, subject to the perpetual 
buzzing of The Castle and an ominous, indefinable whistling. 
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CHAPTER 12 


SOUNDING 
BODIES: MEDICAL 
STUDENTS AND 
THE ACQUISITION 
OF STETHOSCOPIC 
PERSPECTIVES 


TOM RICE 


Introduction 


In 1999 I conducted a short study of patients’ experiences of the sound environ- 
ment at the Edinburgh Royal Infirmary (Rice 2003). For many patients the sounds 
of the ward had become symbolic of the tedious routine of hospital life. The caco- 
phony of voices, footsteps, coughing, cries of pain, rattling trolleys, bleeps from 
medical equipment, and so on served to reinforce and exacérbate feelings of pow- 
erlessness among many patients. Nurses to whom I spoke tended to have a different 
perspective. They generally interpreted the noise of activity on the ward as a posi- 
tive sign, indicative of order and the smooth running of care. They were more alert 
to signáis given by cali buttons and to potential emergencies indicated by particular 
tones from medical equipment and unusual sounds or quietness from the beds. 
Spending time on the wards, I also observed doctors going about their work. I was 
interested to see them listening to patients’ bodies using stethoscopes. The instruments 
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enabled the doctors to shut out the ambient noise of the ward and to immerse 
themselves instead in the acoustic space created by the patient’s body. The hospital, 
it seemed, was a space in which different forms of listening and layers of acoustic 
knowledge were constantly in play (see Schwartz, this volume). 

In order to broaden and deepen my understanding of how sound is implicated 
in hospital life I carried out a year of ethnographic fieldwork between 2003 and 
2004 based at St. Thomas 5 Hospital in London. I was particularly eager to explore 
stethoscopic listening during this period. Aware of the importance of auscultation 
in assessing the health of the heart, I approached consultant cardiologist Dr. John 
Coltart, then head of cardiothoracic Services. He told me that he considered his ears 
to be one of his most important clinical tools and agreed to act as my supervisor. He 
invited me to particípate in the classes he taught, in which medical students learned 
to listen. 

This chapter gives a brief survey of a number of techniques used to appropriate 
body sounds as a diagnostic resource in the Western medical tradition. The work of 
Reiser (1978), Lachmund (1998, 1999) and Sterne (2003) indicates that auscultation 
was instrumental in bringing about important changes in the way in which doctors 
engaged with patients in Western medicine. It allowed them to begin to make diag- 
noses without having to rely on patients 5 illness narratives, creating scope for the 
formation of independent clinical judgments and permitting listeners a degree of 
physical and perceptual distance and detachment. It also required specialist knowl- 
edge, the possession of which granted doctors new social status and distinction. The 
stethoscope's application in examinations of the heart and lungs is now frequent 
and routine in many modern clinical settings, and the instrument has become the 
archetypal Symbol of the Western doctor. 

Though a well-established technique, auscultation has by no means been a 
universal or homogenous practice. Lachmund (1999) explores two distinct (though 
clearly related) approaches to auscultation and the interpretation of sounds as 
diagnostic signs that emerged in París and Vienna during the nineteenth 
century. In doing so he draws attention to the need to understand the practice 
of auscultation (and forms of scientific work more generally) as historically 
located, embedded in contextually specific networks of social and spatial relations. 
With Lachmund in mind, this chapter explores some of the techniques through 
which students are taught to focus their hearing and analyze sounds received 
through the stethoscope. It positions auscultation as a way of knowing through 
sound that is in the process of transmission between generations of medical 
practitioners. 

Sterne, Reiser, and Lachmund document early problems identified with aus- 
cultation. They describe difficulties encountered in creating a robust taxonomy 
for body sounds and in codifying sounds in ways that guaranteed consistent and 
reproducible interpretations between listeners. While the difficulty of establishing 
consensus over the character and significance of subtle heart sounds remained a 
subject of discussion among doctors at the cardiothoracic unit at St. Thomas 5 , aus- 
cultation at the time of my research also formed part of a repertoire of increasingly 
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sophisticated diagnostic techniques. The work for which the stethoscope was used 
was considered by some to be better performed using other kinds of equipment that 
drew on alternative skill sets (cardiac ultrasound scanning, for example). This chap- 
ter, then, locates auscultation not only in relation to discussions of the perceptual 
and linguistic problems which the practice generates, but by reference to debates 
over the valué and relevance of auscultation in the twenty-first century. In doing so, 
it echoes Lachmund’s cali for the particularities of stethoscopic listening to be 
understood within their specific social and historical context. 

The students with whom I worked generally approached auscultation with 
enthusiasm. However, the technique also represented a new and sometimes discon- 
certing way of engaging with the body. In becoming attuned to heart sounds, stu- 
dents were required to encounter the bodily interior in a novel acoustic light. Their 
reactions to being asked to apply a newly acquired auditory “gaze” not only to hos- 
pital patients but also to friends and family members highlighted the intriguing and 
sometimes disturbing experiential subtleties and complexities of an apparently 
simple and straightforward diagnostic technique. Outside the clinical setting, their 
new knowledge of sounds as diagnostic signs could have a destabilizing effect by 
repositioning family members (and even the students themselves) as diagnostic 
objects. Clinical knowledge did not always remain within fixed or tightly delineated 
communities of practitioners but, so to speak, bled out into interactions beyond the 
hospital. The chapter therefore points to the conceptual and imaginative, as well as 
the diagnostic power of the stethoscope. 


A History of Listening 


During one of the first classes I attended during fieldwork, Dr. Coltart asked his 
students, “Who invented the stethoscope?” None of us knew. ‘Til give you a clue,” 
he added. “It wasn’t Mr. Steth.” Then Dr. Coltart went on to tell us how one day in 
1816 a young doctor named René Laennec was walking through the Tuileries Garden 
in París. The case of one of his patients, a young woman suffering from a heart 
condition, was playing on his mind. He had been unable to learn anything about 
her problem from the accounts she gave. In those days doctors used the patients’ 
description of their symptoms as the basis for diagnosis, so Laénnec’s inability to 
glean anything from the young woman’s account represented a serious problem. 
While he was wondering what to do, Laennec saw a group of children playing 
around a log that was sitting on top of a pile of trash. The children at one end of the 
log were pressing their ears to the wood. They seemed to be able to hear the knocks 
and scratches made by the children at the other end. This was Laénnec’s “Eureka!” 
moment. He returned immediately to his patient at the Necker Hospital. Rolling a 
book into a tight cylinder so that it resembled a log, he pressed it to her heart and 
found he could hear her heartbeat clearly. 
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I heard Dr. Coltart repeat this story to other groups of students, and, indeed, it 
can be found in several texts (e.g., Marks 1972; Welsby, Parry, and Smith 2003). 
Others writing on the invention of the stethoscope are less specific as to what 
took place. Reiser States that Laénnec simply “recalled the well-known acoustic 
phenomenon: that sound was augmented when it traveled through solid bodies, 
as when a scratch noise made at one end of a piece of wood can be heard at the 
other end” (1997, 828). He does not specify what caused Laénnec to make this 
recollection. Fleming goes as far as to declare that <c [t]here is . . . no documentary 
evidence to support the attractive traditional story that it was the sight of children 
at play, scratching one end of a log of wood and listening at the other in the 
courtyard of the Louvre, which first gave Laénnec the idea of medíate auscultation” 
(1997, 88). However, while Fleming aims to discredit the myth, he in fact only 
reproduces the “attractive traditional story” by referring to it in such detail. 
Despite a lack of substantiating evidence and even counterclaims or suggestions 
that the stethoscope might have been invented in ancient Egypt (e.g., Rackrow 
2009, Martinet et al. 1998), the story of Laénnec’s invention of the stethoscope was 
well established in the clinical setting in which I conducted my research. It was 
being passed on, reproduced in new generations of doctors. The story was an 
accepted and generally unscrutinized creation myth for a now ubiquitous piece of 
medical technology. 

Although Laénnec is widely credited with the invention of the stethoscope, 
medical practitioners have evidently used the sounds of the body as clues to the 
health of its interior for centuries. Mangione indicates that references are made to 
the diagnostic significance of breath sounds in the Ebers papyrus (c. 1500 bc) and 
the Hindú Vedas (c. 1400-1200 bc) (2000,295). InthefourthcenturyBC,Hippocrates 
described what carne to be known as “immediate auscultation,” which involved 
pressing an ear to the patient’s chest in order to listen to the internal sounds 
(Mangione 2000, 295; Marks 1972, 19-20). Doctors continued to employ this tech- 
nique as the centuries progressed, and it was still in use in the early nineteenth 
century. Fleming (1997) and Lachmund (1999) argüe that the practice was not com- 
pletely abandoned until much later. In the medical community in which my 
research took place, Hippocrates was widely credited with being the first to recog- 
nize and teach listening as a way of discerning the health of the interior organs. 
However, Reiser suggests that it was Robert Hooke who first grasped the huge scope 
and diagnostic potential of auscultation: 

There may be ... a Possibility of discovering the Internal Motions and Actions of 
Bodies by the sound they make, who knows but that as in a Watch we may hear 
the beating of the Balance, and the running of the Wheels, and the striking of the 
Hammers and the grating of the Teeth, and Multitudes of other Noises; who 
knows, I say, but that it may be possible to discover the Motions of the Internal 
Parts of Bodies, whether Animal, Vegetable or Mineral, by the sound they make, 
that one may discover the Works perform’d in the several Offices and Shops of a 
Man’s body, and thereby discover what Instrument or Engine is out of order. 

(Hooke 1705,39-40). 
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The idea that the “Motions of the Internal parts or Bodies” might be discerned by 
their sounds suggests that the actions and movements of each organ could be 
rendered intelligible to the ear, producing a rich and detailed source of diagnostic 
information. 

A further technique for the use of sounds in diagnosis is thought to have 
emerged in 1761, when Leopold Auenbrugger hit upon the idea of percussion. 
He was an innkeeper’s son and as a child had learned to test the fullness of 
barréis by thumping them (Porter 1997, 256). Switching kegs for rib cages, 
Auenbrugger noted that, if struck with a finger, healthy and unhealthy chests pro- 
duced different sounds: tc a healthy chest sounded like a cloth-covered drum ... a 
muffled sound or one of high pitch indicated pulmonary disease” (Porter 1997, 
256). Porter writes that Auenbrugger’s work attracted little attention when it 
first appeared. It was not until after the charismatic physician Jean Corvisart 
published a French translation of it in 1808 and began to apply percussion himself 
that the technique gained general acceptance and began to be used routinely 
(Porter 1997, 308). 

Percussion remains an important technique in the clinical examination. The 
students with whom I studied were taught that, as Auenbrugger had attempted to 
demónstrate, percussive sounds are resonant over healthy, aerated lungs and dull in 
spaces where there is fluid. They would tap the chest using the middle finger of one 
hand to knock on the second phalange of the middle finger of the other. As part of 
the chest examination students were taught to percuss the lungs by tapping at an 
upper, middle, and lower point on each side of the patient’s back and on top of the 
shoulder. Percussion was also applied to the abdomen to determine whether 
abdominal distensión, if present, involved solids or liquids (producing dull sounds) 
or gas (producing resonant ones). 

René Laénnec apparently knew of and admired Auenbrugger’s work on percus- 
sion. He recognized, however, that “we frequently stand in need of a more constant 
and certain sign than that furnished by percussion” (Laénnec 1846, 3). Laénnec was 
also familiar with immediate auscultation, but he considered the technique unsat- 
isfactory in many ways. Because it involved direct contact between the ear and the 
patient’s chest, its best use in his own experience was as a tactile means of discern- 
ing the pulsations of the heart. Little could actually be heard with any clarity. 
Laénnec also felt it to be “alike inconvenient to the physician and the patient; 
its disagreeableness alone often renders it almost impracticable in hospitals” (Porter 
19 97, 2). Sterne gives careful attention to the cultural dynamics of disgust at the time 
Laénnec was practicing, but it is easy to understand that, at a time when many 
patients would have arrived at the Necker hospital in the advanced stages of 
tuberculosis, pressing one’s ear to the chest may have been an unpleasant task 
(2003a, 115-17). Laénnec also wrote of immediate auscultation that “it can hardly 
be proposed to females in general, and in some the large size of the mammae pres- 
ents an insuperable obstacle to its adoption” (1846, 2). Not only was immediate 
auscultation often impracticable on women; for a Frenchman in the early nineteenth 
century it required an unacceptable level of intimacy. 
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As indicated earlier, it is unclear exactly where Laénnec’s inspiration for the 
stethoscope carne from. Though Sterne argües that he may well have crafted the 
story with the benefit of hindsight, Laénnec relates how, sitting at the bedside of a 
young female patient whose gender and obesity made immediate auscultation use- 
less, he rolled a small stack of paper into a cylinder and pressed it to his patient’s 
heart (Sterne 2003a, 102). He was pleasantly surprised to find how well his first 
impromptu stethoscope worked: “I . . . was not a little surprised and pleased, to find 
that I could thereby perceive the action of the heart in a manner much more clear 
and distinct than I had ever been able to do by the immediate application of the ear” 
(Laénnec, cited in Reiser 1997, 828-29). Listening with what he later called the 
“stethoscope,” “pectriloque,” or “cylinder” became known as V auscultation medíate : 
auscultation mediated by an instrument (Lachmund 1999, 424). 

Laénnec’s method in developing auscultation was to listen to the chests and 
particularly the lungs of patients admitted to his hospital, carefully noting what 
he heard. Many of these patients were suffering from tuberculosis, and Laénnec 
would continué to listen as the disease progressed. When the patient died, he would 
conduct a postmortem, systematically referencing the sounds he had heard to 
pathological changes found during dissection (Porter 1997, 263). Laénnec began to 
identify acoustic signs that he believed were characteristic and therefore diagnostic 
of tuberculosis. Ultimately, Laénnec’s approach enabled him to identify the pres- 
ence of abnormalities in the bodies of patients who were still alive. It was no longer 
necessary for patients to die (though they very frequently did) in order for anato- 
mization to take place. Auscultation created scope for the “autopsy of the living” 
(Sterne 2003b). In this respect, Reiser States that auscultation marked “a new age in 
diagnosis” (1997, 831). 

In describing the adoption of stethoscopic listening by other practitioners it is 
important to avoid reproducing what Latour describes as “diffusionist” narratives, 
whereby instruments spread through society as if by their own volition or as though 
moving through a static médium (Latour 1987, 136). Ñor was it simply a question 
of “technology transfer,” by which the stethoscope was taken to other countries. 
As Maulitz, referring to the introduction of the stethoscope to England, points 
out, “[t]he process was rather one in which experience , from the dissection table 
and the hospital wards, flowed through the careers of multitudinous young 
Englishmen as they made the journey out [to Paris] and back” (1987, 136; original 
emphasis). Auscultation as a skill was carried between communities of medical 
practitioners. Also, Furst writes that “it would be erroneous to jump to the conclu- 
sión that the stethoscope was rapidly hailed and assimilated into general practice.” 
Instead, the “adoption of the stethoscope was patchy” (1998, 57). Lachmund (2009) 
and Nicolson (2004) indícate that Laénnec’s ideas were variously accepted, con- 
tested and subjected to reevaluation by communities of doctors in major centres 
of medical practice across Europe. At the same time, though auscultation was 
eventually widely adopted it has never been an entirely homogenous or completely 
standardized practice. I recognize that in what follows, I am describing some of 
the specifics of auscultation as taught to students at a particular socio-historical 
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juncture — a teaching hospital in London between 2003 and 2004. As I go on to 
explain, the technological sophistication of medicine at this time has had important 
implications for the way in which auscultation is taught and learned, and for the 
valué which is assigned to the practice. 


The Body as a Soundscape 


Of the methods of listening just described, all except immediate auscultation have 
retained a place in the repertoire of skills and techniques taught to medical students 
at the time my fieldwork was conducted. These techniques, and in particular medi- 
ate auscultation, allow the body to be constructed as a dynamic acoustic space. 
Schafer (1977) is credited with having coined the term soundscape to refer to the 
specific acoustic character of a space or place , but it is clear that the body, too, has 
a soundscape of its own. It is a “sounding cavity” (Gell 1995, 240). It “sounds and 
resounds” (Rée 1999, 33). 

Some psychoacousticians engage with sound originating within the body as an 
acoustic problem. Barany (1938) refers to the noise produced by the flow of air, 
blood, and the creaking of joints and muscles attached to the skull. Tonndorf writes 
that “trans-lational” waves of sound may be conducted through the body from 
within (1972, 233). It is argued that the ear filters out or minimizes these sounds. 
Thus, for instance, the auditory ossicles (the malleus , incus , and stapes) are sepa- 
rated by fluid-filled membranes that reduce bone-conducted vibration. Sounds 
from inside the body are screened out so as to make the outside world audible. 
Ackerman suggests that, were this not to occur, even the sound of a person’s own 
blood flow “would be as deafening as sitting in a lawn chair next to a waterfall” 
(1990, 189). Using the same watery metaphor, Carpenter and McLuhan suggest that 
the ears filter out “the continuous Niagara of sound . . . in the circulation of the 
blood” (1960, 68; see Helmreich, this volume, on water sounds). 

Both Ackerman and Carpenter and McLuhan imagine the dominant sonic 
characteristic of the body to be one of “flow.” Perceived through the stethoscope, 
too, the soundspace of the body is characterized by flow and by recurrent patterns 
of movement: of blood around the body and through the vessels and chambers of 
the heart, of breath in and out of the lungs, of matter and gases through the gut. Of 
course, auscultation also allows the detection of sounds produced by, for instance, 
contact or friction in the body. A pleural rub, for example, which can occur in 
patients with pneumonia, is created when thickened, roughened surfaces rub 
together as the lungs expand and contract. The sounds that constitute the heartbeat 
are made by the snapping closure of the heart valves. Broadly speaking, however, 
as Sterne suggests, auscultation creates a “hydraulic hermeneutics, charting the 
motions of liquids and gases through the body” (2003b, 203). Feld (1996) describes 
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how the Kaluli of Papua New Guinea have a specialized knowledge of the sonic 
space created by waterways flowing through their (visually inaccessible) rainforest 
environment. There is a sense in which, like the Kaluli, doctors have a specialized 
sonic knowledge, an “acoustemology” of patterns of flow through the visually 
obscured space of the bodily interior (Feld 1996). 

Flowing blood describes the corporeal surfaces and spaces across and through 
which it moves. Turbulence and variation in velocity create an audible hemody- 
namics that can convey detailed information about the efficiency and condition of, 
for instance, the heart valves. Anthropologists have detailed South American 
contexts in which the blood is said to “speak” directly to the healer, revealing the 
condition and needs of the heart (Nash 1967). The healer “hears” it or “listens to 
what it wants” (Nash 1967, 132-33). The blood “speaks” or “talks” (Tedlock 1982). 
Reiser writes that when stethoscopic listening was new to medicine, doctors 
were concerned that they might appear ridiculous because they behaved “[a]s ifthe 
disease itself were a living being that could communicate its condition” (1978, 37). 
On several occasions I heard teaching doctors who had asked students to auscúl- 
tate and give a diagnosis exclaim that a particular sound was “screaming” or 
“shouting” the correct answer — as though it was actively communicating. But these 
colloquialisms, which effectively personifed the body sounds and lent them an 
intentioned voice, represented a delibérate departure by teaching doctors from a 
formal medical discourse whereby the sounds of the body were framed more 
neutrally as incidental by-products of corporeal processes and events. 

The heart’s most obvious “sound event” is the heartbeat (Schafer 1977). Most 
people know this sound through its being at the edge of their awareness of their 
own bodies (Rée 1999, 51; Ackerman 1990, 178). However, recordings of the 
heartbeat are also frequently used in popular music and televisión and film sound 
tracks. Pink Floyd’s The Dark Side of the Moon , for example, begins and ends 
with the heartbeat, while the Minimoog synthesizer has a chart showing the neces- 
sary oscillator configurations required to produce versions of this sound. In film 
sound tracks a gradual speeding up of the heartbeat is often used to build tensión 
or expectancy, while in medical dramas it is employed to evoke a sense of jeo- 
pardy — of a patient’s life hanging in the balance, for instance. The heartbeat has 
become a sonic icón of human life. However, auscultation requires the listener to 
internalize a new semiotics of the heartbeat whereby its character and constituent 
sounds become diagnostically associated with physiological States and events. Like 
cars as described by Krebs (this volume), hearts and bodies more generally are posi- 
tioned as mechanisms in which a range of problems and inefficiencies are articu- 
lated through sound. At the same time, like car mechanics, medical students must 
develop diagnostic listening skills over the course of their professional apprentice- 
ship. However, whereas for the trainee mechanics Krebs describes diagnostic listen- 
ing skills might be acquired through tacit learning, for the medical students 
with whom I studied these skills became the focus of overt and targeted learning 


exercises. 
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Learning to Listen 


At the hospital, Dr. Coltart, my supervisor, suggested I attend the classes at which 
he introduced a group of third-year medical students to auscultation. The time 
taken to qualify as a doctor in Britain varíes depending on a persoiTs route through 
the education system, but the students in whose classes I became a participant 
observer expected their medical training to take five years in total. For the first two 
years, the preclinical years, they had learned through lectures and tutorials; now 
they were embarking on the first of three clinical years that would involve contact 
with actual patients. This year’s practical teaching was subdivided into three 
“rotations,” each lasting for three months and focusing on a specific area of 
the body: the abdomen, head or chest. Dr. Coltart’s classes were part of the chest 
rotation, which included cardiovascular and respiratory medicine. Owing to 
Dr. Coltart’s specialization in cardiology, heart sounds became the particular focus 
of attention during his classes. 

Auscultation of the heart tends to be used in the context of a more general 
cardiovascular examination that usually ineludes the taking of a clinical history, 
a series of careful observations from the foot of the bed, and the feeling of pulses, 
as well as other small checks and procedures, all of which shape the listener’s expec- 
tations of what will be heard. Indeed, The Oxford Handbook of Clinical Medicine 
States that auscultation is: 

generally, but wrongly, held to be the essence of cardiovascular medicine at the 
bedside. A caricature of cardiology ward rounds is of the anxious júnior gabbling 
through the history, while noting his chief s fingers twisting his stethoscope, 
impatient to “get down to the main business” of listening to the heart — thereby 
blotting out all talk in favour of a few blissful minutes communing with the “lub” 
and the “dub.” This is absurd . . . if you spend time listening to the history and 
feeling pulses, auscultation should hold few surprises: you will often already 
know the diagnosis. (Longmore, Wilkinson, and Torok 2001, 39) 

Stethoscopic listening, then, is rarely used on its own but fits within a web of 
interconnected techniques for examining patients. Signs detected by means of the 
technique often confirm diagnoses formulated on the basis of other observations. 

However, while auscultation forms just part of a repertoire of diagnostic prac- 
tices and techniques available to the doctor, at St. Tilomas’ it was also regarded as 
an important skill in its own right. This, I suggest, is not only an Índex of the valué 
of sounds as diagnostic signs but also a reflection of the symbolic importance of 
auscultation. As indicated earlier, the act of placing the diaphragm on the patient’s 
chest might be described as the archetypal gesture of modern clinical expertise. It 
articulates the doctor’s power, knowledge, and skill relative to the patient (Fíansen 
1997, 78; Rice 2010a). The stethoscope is the doctor’s “symbol of office,” and listen- 
ing provides an opportunity for doctors to express what, following Bourdieu, might 
be described as their key professional “dispositions”: concern, responsibility, and so 
on (Marks 1972, 16; Bourdieu 1980, 54). Furthermore, the chest and especially the 
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heart are popularly regarded in the West as the seat of life. A person listening to the 
heart and having medical power in relation to it is accorded particular prestige 
among both medical professionals and patients. 

At the first chest rotation session I attended, Dr. Coltart began explaining that 
the heartbeat is made up of two main sounds. These are known as the first and the 
second heart sounds (often abbreviated as Si and S2) or simply the “lub dub” 
(I have also heard them referred to as the a lub dup” and cc lup dup”). The first heart 
sound, or the “lub,” is caused by the closure of the mitral and tricuspid valves. The 
second heart sound, or the “dub,” by the closure of the aortic and pulmonary valves 
(see figure 12.1). 

In addition, Dr. Coltart explained that, in a normal heart, blood flows around 
the heart and through the valves smoothly, so that the only sounds that can be 
heard through a stethoscope are the closing snaps of the valves that produce the 
“lub” and the “dub.” Sometimes, though, a physiological abnormality can mean the 
blood flow becomes turbulent, and this turbulence creates what are known as “heart 
murmurs.” For instance, if a valve becomes stiff (perhaps through calcification) so 
that blood is forced through a narrower opening than is normally made by a valve 
(this restriction of blood flow is known as stenosis ), turbulence will be created and 
a murmur produced. If the valve ceases to cióse properly or becomes floppy through 
tissue damage, blood will often flow back through it (this is known as regurgitation ), 



Inferior 
vena cava 


Figure 12.1 A simple diagram of the heart. Reproduced with permission of the 

Texas Heart Institute (www.texasheart.org). 
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and a murmur will again be produced, though at a different stage in the cardiac 
cycle and with a different sound. Valve problems are not the only causes of mur- 
murs. Holes in the septum (the muscle wall separating the two sides of the heart), 
for instance, can cause large volumes of blood to be forced back and forth between 
the chambers, producing a murmur known as a shunt Some murmurs appear to 
have no physiological basis at all and are caused by unexplained turbulence in the 
blood flow. These are known as “innocent” or “flow” murmurs. In general, though, 
heart murmurs are linked to malfunctioning valves. 

By indicating the manner in which they correspond to the underlying physiol- 
ogy of the heart, Dr. Coltart set out a basic interpretative framework for heart 
murmurs. He worked on the assumption that auscultation serves to “give light to 
the bodily interior” (Draper 2002, 777). It allows the doctor to visualize what is 
taking place inside the heart, drawing on an existing “body” of knowledge gained 
through dissection (Foucault 1973). Listening acts as an extensión of the clinical 
gaze, so that the “murmur of mitral regurgitation,” for example, might be described 
as a “quasi-visual depiction” of a malfunctioning valve (Lachmund 1999, 432). 
Despite the closely interconnected nature of auscultation with forms of anatomi- 
cally orientated clinical gazing, however, the technique demands the practical 
acquisition of auditory skill, and after his introduction to the theoretical underpin- 
nings of the auscultation of the heart, Dr. Coltart told us that most of our time with 
him would be spent actually learning to hear and identify murmurs. 


An “Ears On” Approach 


Describing the trips made by English doctors to study auscultation in Paris, Maulitz 
writes: “The year in France was, far from a passive period of observation, a veritable 
tour de mairf (1987, 136; italics in original). The notion of a “ tour de main ” captures the 
distinctly practical and skill-orientated nature of the education the doctors received. 
The auditory training auscultation requires means this period of education must often 
have also been a tour d’oreille. For the students with whom I worked, stethoscopic 
listening certainly involved an “ears on” approach (Lachmund 1999, 440). 

Sessions for the chest rotations took place on the cardiothoracic wards. Dressed 
in white coats and clutching our notebooks and stethoscopes we (the students 
and I) would follow Dr. Coltart to the bedside of a patient he judged to be a suitable 
case study for us. We would form a semicircle around the bed, with Dr. Coltart at 
the head on the patient’s immediate left. He would then introduce us to the patient 
briefly and tacitly draw our attention to the importance of being pleasant and polite 
in professional interactions. With the patient’s consent he would then demónstrate 
the cardiovascular examination, going through each stage and drawing our atten- 
tion to any relevant diagnostic signs. Among the checks involved were inspection of 
the patient’s hands and fingers, the taking of pulses in the wrist and neck, noting the 
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coloration of the cornea and condition of the teeth, and palpation of the chest for 
abnormal vibrations. 

After demonstrating these examination stages, Dr. Coltart would proceed to 
auscultation; listening quickly and with well-practiced skill he indicated the key 
points on the chest at which sounds produced by the heart valves are most easily 
discerned. In doing so he essentially produced in our minds a map of the surface of 
the chest, where each point indexed an underlying valve. Having given this demon- 
stration, he would instruct each of us to auscúltate in turn. Like the other students, 
I found it was initially very difficult to make any assessment of the sounds detected. 
I was unsure what to listen for, and fumbling with or misplacing the stethoscope 
often made it impossible to hear anything apart from vague and confusing or 
sudden, sometimes painfully loud, sounds. 

Correcting our listening postures (“You need to keep your head down and for- 
ward”) and our handling of the device (“You’ ve got your earpieces in the wrong way 
around”), Dr. Coltart advised us to listen first of all for the “landmark” sounds, namely 
the “lub” and “dub” that constitute the heartbeat. He suggested we nod our heads with 
each beat or tap out the rhythm with a foot, lending the sounds a kind of kinesthetic 
amplification. He also instructed us to listen while feeling the patient’s pulse not only 
to anchor the sounds of the heart in tactile sensation but also to check for delays between 
the heartbeat and the rise of the pulse as these could be of clinical significance. 

As we became more confident at detecting the first and second heart sounds, 
Dr. Coltart instructed us to begin to listen for heart murmurs and other abnormal 
heart sounds. He suggested we employ a kind of temporal system of focus, spending 
a few seconds listening to the sound as a whole, then another few seconds concen- 
trating on the “lub” and “dub,” then a further few seconds listening for additional 
sounds. He also advised us to mark the first and second heart sounds in our heads 
using imaginary lines. If additional sounds were present, we should shade in the 
spaces on either side of the first or second heart sounds in which they occurred. From 
this visualization we would be able to work out the position of the murmur relative 
to the cardiac cycle. Many students found this technique of visualization helpful, but 
Dr. Coltart was eager for us to become so familiar with different kinds of murmurs 
that we would cease to need to visualize them; instead there would be an automatic 
auditory recognition. 

In addition, Dr. Coltart emphasized that our ability to auscúltate would impro ve 
only with practice. He suggested that we revisit patients whose murmurs we had 
found tricky and go around other wards asking the nurses in charge if they knew of 
patients in their care with interesting murmurs to which we might listen. St Tilomas’ 
was a teaching hospital, which meant that patients had been alerted to the impor- 
tance of education in the institution, and students could quickly gain access to them. 
There was almost invariably a range of patients with different heart sounds for the 
students to listen to, and these patients generally agreed to repeated examinations 
(for a discussion of patient involvement in auscultation classes see Rice 2008). 

The students with whom I worked considered auscultation to be a solitary, even 
isolating perceptual experience (Rice 2010b). This is partly because stethoscopic 
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listening, like the iPod listening described by Bull (this volume), is privatized. The 
sounds of the body are narrowcast — channeled through earpieces — not broadcast 
into public space in the way that music played over a stereo might be. The students 
could never be sure that they were hearing the sound Dr. Coltart wanted them to be 
hearing or whether they were making some mistake in their placement of the stetho- 
scope. Technologies have been developed to get around this problem. Teaching 
stethoscopes with as many as ten pairs of earpieces leading off a single central dia- 
phragm (ensuring that everyone is listening to the same sound) are available, and 
recently digital stethoscopes have been produced that allow heart sounds to be ampli- 
fied, recorded, and played back. However, while they are available to purchase, I did 
not see either of these devices in use at St. Thomas\ At the time of writing, the iPhone 
application iStethoscope Pro became available. This software allows heart sounds to 
be recorded and played back into shared auditory space via the iPhone’s speakers. 
Filters can be applied to make the desired frequencies of the heart sound clearer and 
to minimize background noise. The application may find use in the teaching of 
auscultation in the future precisely because of its capacity to make heart sounds easy 
to share between listeners. 

To get around the problem of the subjective isolation of listening, Dr. Coltart 
and his colleagues provided students with a heart murmur CD. Professionally pro- 
duced by a major stethoscope manufacturer, this set of recordings included voice- 
overs naming each individual heart sound and describing its particular sonic 
characteristics. The CD enabled the students to listen collectively during classes and 
ensured that they were hearing the intended sound. Whenever they detected subtle 
or more unusual sounds, they were also able to refer to the recorded examples as a 
kind of acoustic témplate. However, while the recorded sounds were helpful, they 
tended to be both louder and clearer than those the students were able to hear 
through their stethoscopes. The recordings also lacked the background noise of the 
wider ward, the tactile realism of the stethoscope-user interface, and the contextual 
detail of the doctor/patient interaction. Though after a few days the students were 
able to complete the heart sound quiz at the end of the CD, finding and recognizing 
the same murmurs in real life was more challenging and would be achieved only 
gradually by practicing on real patients. 

The teaching of auscultation also presented a problem of vocabulary. Standing 
at the bedside, Dr. Coltart often tried to describe heart sounds for his students. “It’s 
that long, low, rumbling sound I want you to be getting,” he might say, or “Can you 
hear that harsh, rasping, almost squeaking sound?” But the adjectives were usually 
inadequate and prone to misinterpretation. He also resorted to mimicry to capture 
the quality of murmurs. “It’s the ush, ush, ush I want you to listen to,” he might say, 
or the “lup-dup-shh, lub-dup-shh.” His use of these nonsense words often had a 
comic effect, particularly because of their contrast with the serious tone and formal 
terminology that generally characterized clinical discussions. 

The problem of providing reliable descriptions of sounds evidently preoccu- 
pied René Laénnec. He employed a range of símiles, describing one lung sound, for 
example, as “a tinkling similar to that of a small bell just ceasing to ring, or of a fly 
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buzzing in a china vase” (1846, 320). Others resembled, for instance, snoring, the 
cooing of a wood pigeon, or the rubbing of a bass string (Marks 1972, 71-72). 
Lachmund (1999) writes that Skoda, a key figure in the development of the “Viennese 
approach” to auscultation, also made use of analogies between body sounds and 
acoustic experiences from the everyday soundscape, though to a lesser extent than 
Laénnec. 

Both Laénnec and Skoda also struggled to produce functional classification 
systems for the lung sounds they detected. Lachmund (1999) describes how Skoda 
conscientiously critiqued and reworked an (o ver) elabórate and fine-grained taxon- 
omy put forward by Laénnec, producing a simpler and more straightforward system 
of his own. In the taxonomy of heart murmurs used and taught at St. Thomas’s, the 
sounds were described by reference to four parameters: tone, volume (on a scale of 
I to VI), place in the cardiac cycle, and location on the chest. Thus, it became possible 
to refer, for instance, to “a loud, grade IV, ejection-systolic murmur in the aortic 
area.” A murmur meeting these criteria would usually indícate aortic stenosis. Use of 
this system was evident in numerous referral letters, suggesting that it had attained a 
high degree of standardization in communities of clinical practice in Britain. 

This system of classifying heart sounds did not mean that there were no longer 
difficulties in establishing consensus on the nature and significance of heard sounds. 
Auscultation is inevitably interpretive or hermeneutic. Findings made by using the 
technique are contestable. The Oxford Handbook of Clinical Medicine States that 
a [t]he first and second sounds are usually clear”; however, CÍ [c]onfident pronounce- 
ments about the other sounds and soft murmurs may be difficult. Even sénior 
colleagues disagree with one another about the more difficult murmurs” (Longmore, 
Wilkinson, and Torok 2001, 80). During my fieldwork I saw a number of instances 
in which a doctor would have his assessment of a heart murmur corrected by a 
more sénior member of his team or in which a third doctor was brought in to offer 
an opinión on a murmur whose significance two other doctors could not agree on. 
Importantly, however, systems for communicating about and classifying body 
sounds have evidently been good enough to make auscultation viable and applicable 
as a diagnostic technique in a range of Western medical contexts. By developing 
their practical and sensory understanding of auscultation alongside a more techni- 
cal appreciation of the structure and significance of heart murmurs, the students 
with whom I worked gradually increased their proficiency at detecting and inter- 
preting sounds. They became more effective participants in and bearers of what 
Sterne calis “medicine’s acoustic culture” (Sterne 2003b). 


A Dying Art? 


In the medical culture in which my fieldwork took place the stethoscope was 
considered by some to represent something of a throwback to the “oíd world.” 
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Though still in routine use, a number of doctors referred to the instrument as 
ÍC a dying technology” or to auscultation as cc a dying art.” The difficulties of consis- 
tent interpretation and reliable communication over heart sounds (as indicated 
earlier, a problem throughout the history of the use of the technique) were consid- 
ered by some to be unacceptable flaws. Though in the right hands (and ears) a 
highly sensitive instrument, the stethoscope was felt by some to have reached its 
diagnostic limits. Investigations could be done with greater thoroughness and accu- 
racy and with less scope for interpretive variation by more powerful technologies. 
Echocardiography, for instance, also known as cardiac ultrasound, can be used to 
produce detailed images of the interior of the heart, showing the valves and making 
their functioning clearly visible in real time. It can also create images of hemody- 
namic flow and allow the thickness of the heart’s muscle walls to be measured with 
considerable accuracy. 

The students with whom I worked knew they would be obliged to demónstrate 
a degree of skill in auscultation in order to pass exams that would allow them to 
progress through medical school and eventually qualify. But they were also con- 
scious of the diagnostic power of echocardiography. They could imagine that, once 
qualified, listening would be necessary in making an initial assessment of whether a 
patient’s heart was normal. However, they felt that, should any abnormality be 
detected, they would almost certainly be expected to send the patient for an echocar- 
diogram. They suspected they might never have to make fine distinctions between 
subtly different sounds. Auscultation might effectively become a form of triage 
for them. 

During fieldwork I attended several teaching sessions with a young woman 
named Zani, a doctor from Pakistán who was attending St. Thomas’ in order to 
qualify to practice in Britain. She pointed out that in Pakistán many hospitals are 
comparatively underresourced and tend to be less technologically sophisticated 
than those in the West. Echocardiography was not widely available and was too 
expensive for routine use. As a consequence, auscultation was considered to be a 
highly important component of both medical training and practice. Zani believed 
the training in stethoscopic listening she had received in Pakistán was more rigor- 
ous than that undergone by most students in Britain. She also felt it was important 
that she be self-assured and confident in her listening skills. Clearly the emphasis 
placed on expertise in stethoscopic listening varíes considerably according to cul- 
tural lócale. I noticed that Dr. Coltart told several groups of students about the time 
he had spent at Stanford University, where he had had a visiting professorship. He 
related how at this prestigious medical school he had impressed his colleagues with 
his bedside diagnostic abilities and in particular his skill at auscultation. Physicians 
there, he said, did not receive such extensive training in stethoscopic listening as 
those in the UK. They were generally more heavily dependent on test results than 
their counterparts in Britain. His remarks resonate with comments in a number of 
written sources (Mangione and Nieman 1997; Kirsch 1998; Babu 1999). 

It is easy to see that in auscultation, the “network” of relationships between people 
and technologies used in making a diagnosis is relatively simple (Latour 1987, 204). 
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The doctor “enlists” the stethoscope and engages his own skill, knowledge, and 
powers of judgment in examining a patient, sometimes also conferring with other 
docto rs (Latour 1987, 178). In echocardiography, a wider network of machines and 
technicians becomes involved in diagnostic decisión making. The reconfiguration 
of people and technology this wider network requires may be resource intensive 
and expensive to opérate, only becoming viable in comparatively wealthy settings. 
Where it is mobilized, however, diagnostic agency is dispersed, and skill sets 
are reconfigured. The capacity to auscúltate may no longer be demanded of the 
physician as a priority. 

Yet even in contexts where echocardiography is in routine use, it seems hasty to 
State categorically that the stethoscope is “dying” or that auscultation is ct a dying 
art.” Echocardiography and auscultation do not necessarily cover precisely the same 
diagnostic territory or have identical purposes. For instance, auscultation may be 
useful and expedient in the first phase of examination. It may provide a means of 
distinguishing serious problems from trivial ones. It may also offer an efficient 
means of repeatedly checking or monitoring the health of, for instance, a heart 
valve over time, while echocardiography creates more detailed diagnostic informa- 
tion where time and resources allow and is likely to be of particular valué where 
patients are being prepared for surgery or other procedures. For Dr. Coltart it was 
important that his students should be independent and resourceful and not rely too 
much on the wider networks of equipment and technical knowledge that echocar- 
diography requires. Where he determined the orientation of classes, auscultation 
was emphasized as a key skill and it seems unlikely that the students he was training 
will soon definitively pronounce the stethoscope dead. Indeed, they are more 
likely, through their own practice, to testiíy to its continuing life. Nonetheless, the 
students also indícate that auscultation has both a variable status and a valué 
that are contingent upon the availability (or otherwise) of more powerful medical 
technologies and those with the expertise to opérate them. 


The Auscultation of Everyone 


At every available opportunity Dr. Coltart told his students that they should listen 
to people’s chests. This meant not just hospital patients but people everywhere. He 
joked that even when we were sitting on the Tube (London’s rapid transit system) 
we should ask the person sitting next to us if it would be all right to listen to 
their heart. From many of its rooms St. Tilomas’ Hospital has remarkable views 
out over the River Thames, the Houses of Parliament, and Westminster Bridge. On 
one occasion Dr. Coltart made a sweeping gesture toward the bridge and asked us 
to imagine what would happen if we were to stop all the people walking across it 
and listen to their hearts: “Think of all those interesting sounds. There would be 
hundreds of people walking around with murmurs who didn’t even realize they 
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had them!” For Dr. Coltart the population of London could be understood as a vast 
collection of medical cases within and among whom lurked many murmurs. 

The group would often kill time between teaching sessions drinking coffee 
in the hospital café. After the session in which Dr. Coltart had suggested that 
we listen to all Londoners, we began to discuss how strange this thought was. 
Tom explained: 

You suddenly realize that they are inside everyone, these heart sounds . . . and the 
stethoscope is like a window to that. If I just walked around the Street and 
listened to everyone ... I would be bound to hear murmurs that people never 
knew they had, and suddenly they’ ve got a disease. Suddenly there’s something 
wrong with them. 

Harjit thought similarly: “I had never really thought about people in that way 
before. I could go around with my stethoscope, and I would pick up heart murmurs 
in people everywhere, not just in the hospital but in the Street.” A student named 
Dave described how, listening to a CD of heart murmurs as he traveled to the hos- 
pital by bus, he was surprised to find himself imposing the murmurs he was hearing 
onto people he saw in the Street as if their (unhealthy) heart sounds were amplified 
or he could hear them from a distance. Making the population at large subject to 
(albeit hypothetical) medical scrutiny, then, the students effectively collapsed any 
distinction between patients and nonpatients. Adopting a kind of stethoscopic 
macrofocus, they evoked the project of “surveillance medicine,” with “all persons . . . 
becoming patients” (Armstrong 1995, 397). 

In Medicine , Rationality, and Experience, Good explores the manner in which 
Harvard Medical School students are led to “think anatomically” through their 
training and how as a consequence they internalize “an alternative way of seeing” 
(1994, 73). Good suggests that the students carry and apply the gaze they acquire at 
medical school beyond the campus. After conducting dissections and autopsies, for 
instance, they begin to mentally anatomize the living people they encounter in 
everyday life. Similarly, the students with whom I studied found their acquired gaze 
and, in this particular instance, their acoustic perspective on the body applying 
beyond the hospital. 

Another of Dr. Coltart’s teaching strategies was to encourage his students to 
practice listening on the people cióse to them, as well as people in and outside the 
hospital. They should auscúltate their boyfriends and girlfriends, as well as their 
family members. This brought the stethoscopic focus to a domestic or family rather 
than a societal level. The change proved worrying and difficult. There was always a 
possibility that the students might hear something they didn’t want to. As Mary 
explained: 

When you’re in hospital, you want to hear murmurs. You want to hear loads of 
noises. But I was listening to my Dad’s chest, and I was thinking “Please, let there 
be just normal heart sounds, no murmurs.” I was worried because I know he 
smokes. And I was thinking “Come on Dad! Be normal! Be normal!” Then I was 
listening and I thought “Thank God I can’t hear anything! Thank God there are 
no murmurs.” 
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Rishi told me about the time he and his identical twin brother, who was also 
studying medicine in the same year, listened to their father’s chest: 

I remember my brother listening to my dad’s chest. He’s had an MI [myocardial 
infarction, commonly known as a “heart attack”] in *95. Anyway, my brother was 
listening to my dad’s chest, and he heard a diastolic murmur. I carne into the 
room, and I listened and heard it, too. My brother and me both agreed it was a 
diastolic murmur. It worried my Dad — enough to make him go to the GP 
[general practitioner]. 

Several other students found their family members were unwilling to undergo the 
examination at all. They were too concerned about what the stethoscope might 
reveal. The students were also uncomfortable putting cióse relatives under their 
clinical gaze. They disliked momentarily reformulating kinship relationships as 
doctor/patient relationships. They found they could not maintain the position of 
clinical/social distance and detachment that characterized their relationship to the 
people they saw during teaching sessions. 

Sinclair writes that the medical students he studied were generally encouraged 
to develop and test their anatomical knowledge by relating it to their own bodies 
(1997: 153). However, when it carne to auscultation they were told “Don’t listen to 
your own heart; it’s confusing!” (1997: 203). During my fieldwork Dr. Coltart actively 
encouraged us to listen to ourselves, a technique known as a autoauscultation.” He 
said it would give us a chance to familiarize ourselves with normal heart sounds 
(assuming our bodies would be clinically normal and reaffirming a dialectic between 
the “healthy” doctor or medical student and the “sick” patient) and would enable 
us to think through the relationship of the heart sounds to the cardiac cycle. 
Auscultation could be properly practiced only on a person , and as we ourselves were 
more readily available teaching material even than patients it made sense to listen 
to our own hearts. 

Still, bringing a stethoscopic focus to the individual self could be frightening; as 
Tom put it, “You just never know what you’re going to hear.” He continued: “My 
Dad has had a few heart problems, and Im a bit of a hypochondriac ... I don’t 
really want to turn this ear that Tm being trained to use on myself.” In fact, both 
Tom and another student named Alistair found they had quite bad sinus arrhyth- 
mia. Sinus arrhythmia can often be observed in young men. The heart tends to 
speed up perceptibly during inspiration and to slow down again during expiration. 
Although not a sign of disease or abnormality, it can be disconcerting, particularly 
if an entirely regular heart rhythm is expected. Harjit experienced a similar fear of 
what she might hear: “I carne back from class and was sitting at my desk, and 
I thought, ‘Wait, iVe never listened to my heart before.’ I was uncertain about 
whether I should or not. I was very scared that I might hear something bad.” None 
of the students I studied detected an abnormality, though it later became known 
that a female medical student in a different class had heard a murmur and become 
very distressed. She had gone crying to her teaching doctor and had never been 
entirely convinced by his reassurances that hers was a flow murmur and therefore 
“innocent.” It is easy to understand why she was reluctant to believe him given the 




316 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


strong emphasis placed on murmurs as indicators of valve disease. Autoauscultation 
then, and the diagnostic anatomization of ones self it involves, was responsible for 
cases of what Sinclair refers to as ‘medical student disease’, a form of hypochondria 
which frequently results from the student’s application of their clinical knowledge 
to their own bodies. 

For those who heard their heart sounds as normal, autoauscultation became a 
reassuring, even pleasant experience. Some students would try running on the spot 
while listening to themselves, seeing how fast they could make their hearts beat and 
how loud they could make them sound. Rishi’s friend Ambrose said that he would 
spend large amounts of time listening to his own heartbeat and would lie in bed, 
sometimes falling asleep with the stethoscope still in his ears. He found the 
sound comforting and speculated that this might be because the sound of one’s 
own heartbeat created acoustic conditions similar to those experienced in the 
womb. It is clear, though, that diagnostic techniques and the perspectives they 
create do not remain restricted to the hospital setting. Instead, they spill out of the 
clinical context and create informal perceptual engagements and interactions. 
Students encounter not only “the body,” unassociated with a specific owner or 
linked to a socially distant patient, but also socially cióse and familiar bodies through 
medical methods of perception. Turning the stethoscope back on themselves, they 
also encounter their own lived and experienced bodies — what Draper (2002) calis 
the “my body” — in new and potentially unsettling ways. 


Conclusión 


I have introduced the practice of cardiac auscultation as a specialized appreciation 
of the soundscape of the body and briefly described the manner in which medical 
students begin to learn the auscultation of the heart. The training in auscultation 
that medical students receive grants them a particular acoustic perspective on 
the sonic space of the bodily interior, amplifying it, bringing it to the center of 
their attention. Importantly, however, it is not only the patient body that is acousti- 
cally configured through auscultation. Medical students must also change their 
own bodies in the course of their training. They are obliged to learn new bodily 
techniques — types of practical sense and sensitivity — in order to become compe- 
tent auscultators. They must take on unfamiliar frames of auditory attention and 
analysis. 

Writing on fetal ultrasound, Draper (2002) describes the manner in which 
the introduction of this technology allowed women to see images of their unborn 
babies. These images were shared with the technician operating the scanner and 
could also be distributed to partners, family, friends, and so on. Draper argües 
that in this process the uniqueness of the woman’s pregnancy experience is 
disrupted. The fundamental axis of pregnancy shifts from one of haptic hexis 
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(characterized by notions of “touch,” “feeling,” and “being”) to an engagement 
with a visual image. I suggest that Draper slightly overstates her case; after all, the 
ultrasound sean does not erase the hapsis of pregnaney altogether. Nonetheless, she 
does make the important point that techniques of medical perception have the 
capacity to affect radically the way in which people engage with their own bodies. 
As the example of autoauscultation demonstrates, the stethoscope can create new 
senses of bodily awareness and possibility organized through sounds. At the same 
time, as was shown when the students began to listen to their friends and families, 
it can produce or exacérbate anxieties over what constitutes a healthy body and 
how a healthy body should sound. 

Historically, auscultation has been involved in the wider medical project 
of deconstructing the body by positioning its organs, tissues, and processes as 
“scientific work objeets” (Sharp 2000, 298). More recently, echocardiography might 
be construed as another means by which scientists technologically shape and 
manipúlate sound (albeit ultrasound) in order to propágate and reproduce this 
clinical-perceptual approach. At the beginning of this chapter I briefly described 
the soundscape of the hospital wards at the Edinburgh Infirmary. One feature was 
the electronic tones produced by medical equipment (in particular the electrocar- 
diograph). This sound was also very noticeable in the wards of the cardiothoracic 
unit at St. Thomas’ Hospital, where the monitoring of patients’ hearts was particu- 
larly intensive. Sound, then, plays an important role in medical projeets of surveil- 
lance, as well as anatomization. I would suggest that medical techniques like 
echocardiography and electrocardiography and potentially even tools such as the 
iStethoscope Pro described earlier, are dictating new chapters in the story of medi- 
cine^ sonic mediation of information about bodily spaces, surfaces, States, and 
events. Interestingly, an a ears on approach” will be crucial not only for medical 
students, but also for those involved in Science and technology studies as they seek 
to understand new acoustic technologies and their significance for the production 
of medical and scientific knowledge. 
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CHAPTER 13 


DO SIGNALS HAVE 

POLITICS? 
INSCRIBING 
ABILITIES IN 
COCHLEAR 
IMPLANTS 


Mara Mills 


Introduction: The 
Neural-Computer Interface 


In 1998 roboticist Hans Moravec published a millennial prediction about the immi- 
nent convergence of humans and machines — based on evidence from the evolution 
of sound technologies: 

In a few decades, people may spend more time linked than experiencing their 
dull immediate surroundings . . . Linked realities will routinely transcend the 
physical and sensory limitations of the “home” body. As those limitations 


I would like to thank Michael Chorost and Charles Graser for their generous feedback and 
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become more severe with age, we might compénsate by turning up a kind of 
volume control, as with a hearing aid. When hearing aids at any volume are 
insufficient, it is now possible to install electronic cochlear implants that 
stimulate auditory nerves directly. Similarly, on a grander scale, aging users of 
remóte bodies may opt to bypass atrophied muscles and dimmed senses and 
connect sensory and motor nerves directly to electronic interfaces. Direct neural 
interface would make most of the harness hardware unnecessary, along with 
sense organs and muscles, and indeed the bulk of the body. The home body 
might be lost, but remóte and virtual experiences could become more real 
than ever. (Moravec 2000, 169) 

Electroacoustics has been at the forefront of signal engineering and signal Process- 
ing since “the transducing 1870S,” when the development of the telephone marked 
the first successful conversión of a sensuous phenomenon (sound) into electrical 
form and back again (Hunt 1954, 37). By the second half of the twentieth century, 
acoustics research centers in the United States, such as Bell Telephone Laboratories, 
the Harvard Psychoacoustic Laboratory, and Bolt, Beranek and Newman, had made 
central contributions to the digital coding of signáis and to Computer networking; 
the “overarching themes,” as John Swets has argued, were “information processing 
and man-machine integration.” 1 

In 1984, after FDA (U.S. Food and Drug Administration) approval of the 3M 
cochlear implant for adults, neuroprosthetics entered the commercial sphere. 2 The 
cochlear implant (CI) delivers electrical signáis directly to the auditory nerve. With 
approximately 200,000 users today, these devices remain the most common neural- 
computer interfaces in the world. 3 The current technology ineludes an external micro- 
phone and a speech processor — a tiny Computer with software that can be upgraded. 
The processor variously transduces, samples, and codes environmental sound in order 
to transmit it to the auditory nerve through up to twenty-four electrodes, which 
functionally replace the thousands of hair cells in the inner ear. 

Concurrent with their entrenchment in futurist discourse, cochlear implants 
quickly entered the canons of bioethics and disability studies, raising questions about 
the definition of impairment, the feasibility of pediatric informed consent, and the 
cost-effectiveness of neuroprostheses. Many bioethicists have taken up the Deaf cul- 
ture or linguistic minority critique of implantation, which situates this technology in 
the long history of eugenicist attempts to promote oralism through the medical erad- 
ication of deafness and through pedagogical bans on sign language (Beard 1999; Berg, 
Alice, and Hurst 2005; Crouch 1997; Levy 2002; Sparrow 2005). Despite the promi- 
nence of the cochlear implant in disability studies, bioethics, and Science fiction, 
however, it has inspired little research in Science and technology studies (STS). Stuart 
Blume, a sociologist of Science and parent of a deaf child and a hard-of-hearing child, 
conducted the most substantial fieldwork in the 1990S on the reception of implants in 
France, the United States, England, Sweden, and the Netherlands (Blume 2000). 
Blume detected “two very different accounts of cochlear implantation”: 

One is a tale of medicine’s triumph, akin to many other such tales: a tale of 
courageous pioneers, of the wonders of medical Science and technology. 
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The other is in a genre which has emerged only in the past two decades and 
which highlights the subordination of medicine to surveillance, social control and 
normalisation. This a tale of the oppression of the deaf: of hearing society’s 
inability to accept deaf people for what they are. 4 

Yet more than one identity group is accommodated within the category of 
deafness. It ineludes members of Deaf culture, self-defined as linguistic minorities 
who sign (and whose “disability” is largely an effect of the built environment and 
social stigma); late-deafened adults, who tend to claim disability from hearing loss; 
and oral deaf and hard-of-hearing individuáis of all ages. 5 Without rejecting Blume’s 
assertions about the Deaf response to cochlear implants, it is possible to write a 
third history of this technology — a history that ineludes the active participation of 
late-deafened volunteers in research and development and at the same time depiets 
their distinctive patterns of stigmatization and exclusión. In The Artificial Ear: 
Cochlear Implants and the Culture of Deafness , published in 2010, Blume largely 
maintains his focus on Deaf culture and the debate surrounding pediatric implan- 
tation. He concludes, “The demands of Deaf community leaders and advocates had 
little or no effect either on development of the cochlear implant or on the begin- 
nings of local implantation practices. The experience of deaf people was not accepted 
as essential or even as relevant” (Blume 2010, 197). 6 

Reflecting on the participation of cochlear implant users (including himself) in 
triáis of sound-processing software, technology theorist Michael Chorost offers 
a different perspective on the relevance of deaf experience to the making of this 
technology: 

Even without being able to write code themselves, implant users do have a crucial 
impact on how the code is written. When engineers write new code, they have to 
test it on implant users to see if it helps them hear better. They also have to find 
out if implant users like it and can get used to it. To do that they need to recruit 
articúlate users and convince them to offer their time. It’s a highly collaborative 
process and is integral to how the field makes new advances. (Michael Chorost 
blog, comment posted January 5, 2006) 7 

In this chapter I follow some of the trajectories by which the autoexperiments, field 
notes, and laboratory tests of early users have left traces in the hardware, as well as 
the software, of cochlear implants. A species of “the co-construction of users 
and technologies” genre in STS, this chapter also considers the distant but intímate 
relations between lead and end users (Hippel 1986; Bijker 1995; Oudshoorn and 
Pinch 2003). 

On the one hand, the anatomy and phenomenology of experimental research 
participants exert a subtle influence on the experiences of users downstream. STS 
scholars have noted that “scripts” — defined by Madeleine Akrich as “the represen- 
tations of users” embedded within technology — often materialize during the research 
and development, clinical trial, or testing phases of technical development. 8 Designers 
do not simply “project” users into cochlear implants; from surgery to speech Pro- 
cessing, these devices are inscribed with the competencies, tolerances, desires, and 
psychoacoustics of early users. 
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On the other hand, the recommendations of test subjects have as often been 
expunged as built into cochlear implants. In what I am calling cross-purpose col- 
laboration , social norms, medical ideáis, and commercial interests have vied with 
the needs and preferences of deaf people in the construction of implant technology. 
In 1980 Langdon Winner asked whether artifacts had politics; for present-day 
electronic and digital media, politics can be found at the level of signáis. Specifically, 
CI signal processors embody a range of cultural and economic valúes, some of 
which are deliberately “scripted” into design, others of which accrete inadvertently. 
These scripts inelude the privileging of speech over music, direct speech over tele- 
communication, nontonal languages over tonal ones, quiet “listening situations” 
over noisy environments, and black-boxed over user-customizable technology. 9 All 
technical scripts are “ability scripts,” and as such they exelude or obstruct other 
capabilities. Due to the complexity and opacity of electronic technology, these con- 
straints often prove impossible for users to circumvent. The “home body” is thus 
not lost with this new médium; practices of listening are radically materialized. 
Users, moreover, experience their devices across the corporeal registers of hearing, 
visión, and tactility. 


Bionic Rhetoric 


According to Bonnie Tucker, a deaf legal scholar, “The hatred with which Deaf 
culturalists view cochlear implants is expressed in the ASL sign for the cochlear 
implant, which ineludes a two-fingered stab to the back of the neck, indicating a 
Vampire in the cochlea’ ” (Tucker 1998, 9). Although the sign language community 
is more diverse than is often acknowledged — and increasingly ineludes bicultural 
users of cochlear implants — existing animosity toward these devices derives from 
countless disappointments in Western Science and medicine, accumulated over the 
last two centuries. Harían Lañe, a hearing author who earned a MacArthur Award 
for his philosophical histories of signing Deaf culture, has detailed the repeated 
scientific victimization of deaf individuáis — from Jean-Marc Gaspard Itard’s 
application of leeches and electricity to his students 5 ears in the nineteenth century, 
to Alexander Graham BelLs prohibitions on deaf intermarriage, to Nazi steriliza- 
tions and executions (Lañe 1993). Harry Lang, a deaf professor at the Rochester 
Institute of Technology, tells of the more subtle losses that have attended scientific 
“progress” for deaf people: 

A glimpse into history also provides some understanding of why there is so much 
emotion attached to technological advances. “Advances” in voice telephony led 
to a ninety-year delay in access to the telephone for deaf people. “Advances” in 
adding the sound track to silent movies led to more than forty years of lost access 
to films. For hundreds of years, deaf people, viewed as “disabled,” have been 
treated with Chemical and electrical “cures,” sent up for airplane dives, and 
subjected to a multitude of other medical fixes. (Lang 2002, 91-92) 
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Equally sobering, cochlear implants have been correlated to sign language death 
and “cultural genocide,” especially since 1990, when the FDA approved them for 
children (Wrigley 1996; Ladd 1985). In 2004, Australian linguist Trevor Johnston, 
whose parents are deaf (and sign), published a demographic survey in the American 
Annals of the Deaf ( with the bittersweet title “W(h)ither the Deaf community?”), 
which predicted an end to Australian Sign Language (Auslan) within “half a life- 
time” as the result of “improved medical care, mainstreaming, cochlear implants, 
and genetic Science” (Johnston 2004, 370). Sign Language Studies dedicated its 
Winter 2006 issue to the international “comments” spurred by this article; authors 
reported comparable situations in Norway and — with a more gradual timeline — 
the United States. Teresa Blankmeyer Burke, professor at Gallaudet University, 
insisted upon the resilience of sign language and Deaf culture; she noted that if 
Johnston’s forecast turned out to be true, however, it would constitute a novel 
“instance of scientific progress directly threatening a linguistic community” (Burke 
2006, 175). 

Stuart Blume has examined the establishment of “clinical feasibility” and 
respectability for cochlear implants throughout the 1970S and 1980S. He concludes 
that most Deaf people were initially indifferent; their response “was quite unlike 
that of actual or potential AIDS patients, who (at least in the United States) stressed 
the right to earliest possible access to what might prove a life-saving drug” (Blume 
1997, 33). 10 Blume acknowledges that deafened individuáis “deluged” electroacous- 
tic and otological researchers with inquines; deafened advocates in fact urged phy- 
sicians in France, the United States, and Australia to develop the first implants and 
offered themselves for surgery (Blume 2010, 31, 33). Blume argües that physicians 
subsequently deployed “bionic rhetoric” and sensational performances by early 
volunteers to convince further test subjects that implants were beneficial. While 
Blume does not explain this ostensible drop-off in interest among deafened people, 
I suspect that the invasiveness of the experimental procedure and the limited 
definition of the first cochlear implants were deterrents, not to mention the out- 
spoken skepticism of many others in the medical community. Moreover, after the 
3M/House implant gained FDA approval in 1984, “the market grew far more slowly 
than had been anticipated,” an outcome due in part to the tremendous expense of 
the device (Blume 1997, 38; see also Blume 2010, 51; Zeng 2007). 

The rhetorical promotion of cochlear implants as “bionic ears” (as opposed to 
imperfect prostheses) unintentionally generated a counterrhetoric among Deaf activ- 
ists. In the 1990S, Blume concluded that the forcé of their counterrhetoric was such 
that this “stigmatized and relatively powerless group [became] a significant actor in 
the process of technological change” (Blume 1997, 46). The contributions of Deaf 
actors were, in Blume’s account, inhibitory — discouraging implantation rather than 
affecting the design and fabrication of these devices. However, with the global escala- 
tion of implant adoption (due to factors ranging from improved technology to low- 
ered age limits for legal implantation), Blume (2010) has reevaluated the protests of 
the Deaf community as disappointingly ineffectual. He registers a nearly impassable 
ethical predicament: When hearing parents choose implants and mainstream (oral) 
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education for their deaf children, they diminish the population of native signers. One 
solution, Blume suggests, entails “rewriting kinship” — rethinking the imperative of 
common ability within the family and imagining linguistic communities external to 
family or nation. 11 At the most basic level, some of this work could occur through 
changed counseling practices in otolaryngology clinics. 


Single Channel 


For the most part, the first implantations were conducted with late-deafened indi- 
viduáis; those born deaf were not considered good trial candidates because they 
could not compare the electronic “ear” to prior experiences of hearing. From the 
outset, a tensión has existed between normalizing therapeutics and the unexpected 
effects of the technology. Implants provide a novel mode of auditory perception, 
and, at once, they often socially disable those who wear them. Although cochlear 
implants seem to promise the replacement of a lost sense, by audiological standards 
implant users continué to have a hearing impairment. 

Direct electrical communication with the human auditory nerve dates to 1957, 
when Charles Eyriés and André Djourno implanted “Monsieur G.” at Flnstitut 
Prophylactique (Arthur Vernes) in París. The patient — an engineer — had lost hear- 
ing in both ears after a surgical procedure. 12 Eyriés was a practicing otolaryngolo- 
gist, Djourno a trained physician who had turned to basic research in medical 
electricity. Djourno studied electrocution and electroshock in animáis and cadav- 
ers, funded by Électricité de France, which was interested in potential countershock 
applications. To this end, he constructed a number of induction coil implants 
(“microbobinages”) for muscle and organ telestimulation. 

Earlier, in 1953, Djourno had met a deaf man who “had considered suicide” and 
tc said that it was utterly unbearable, this condition of no longer hearing ... he said: 
T would prefer any noise . . . even if it’s far from a real sound.’ ” Djourno stimulated 
the man’s ears temporarily with electrodes, and he heard a few sounds. In a pub- 
lished article soon thereafter, Djourno speculated that the implants he had designed 
might be appropriate for treating deafness. 13 

Djourno and Eyriés claimed that Monsieur G. had similarly “requested that the 
impossible should be tried in order to correct — even to the most limited degree — 
his total deafness. He was so insistent in his desire that we decided — in spite of the 
possibility of a failure, as we informed the patient — to embed an induced coil” 
(Djourno, Eyriés, and Vallancien 1937). 14 They implanted one of Djourno’s coils 
through the skull to the eighth nerve on February 25, 1957. During postoperative 
testing, Monsieur G. heard a number of high chirps and whistles (Seitz 2002, 81). 
He participated in “reeducation” four times a week for several months; when deliv- 
ered speech signáis through a microphone, he perceived low tones as the tearing of 
“jute cloth,” higher tones as that of a silk.” 
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Despite the incredible acoustic distortion, Djourno and Eyriés reported: 

He very much likes to have the apparatus working for the pleasure of hearing 
people come and go, slamming the door, or listening to conversation going on 
around him. Turning off the apparatus plunges him into a silence which he finds 
unbearable for a few minutes. In contrast, an unfortunate bump against the 
microphone results in a violent noise which deafens him for several seconds. 

(Djourno and Eyriés 1957, 1417) 

When the implant ceased functioning after some months, they repeated 
the procedure. This, too, failed, at which point Eyriés abandoned the project (see 
figure 13.1). 15 



Figure 13.1 The lower “microbobinage” is the implant, the upper is the (external) 
signaling coil, which attached to a microphone transmitter. Designed by Djourno 
(France, 1950S). Photograph courtesy of the John Q. Adams Center for the History of 

Otolaryngology, Head and Neck Surgery, Alexandria, Va. 
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Djourno performed one final implantation, collaborating with otolaryngolo- 
gist Roger Maspétiol. This second patient, N.T.L., was a Vietnamese girl who had 
lost her hearing after taking streptomycin for tuberculosis. She, however, was 
“reluctant” to undergo the surgery (“it was her father who had committed her to 
this operation”). 16 Afterward, she disliked the “low moaning sound” produced by 
her implant. She also disliked wearing the external transmitter, even after they 
attached it to a headband so her hair would conceal it. “She spoke impeccably,” 
Djourno commented, “yet complained a lot all the time because Indochinese men 
and women do not have the same accent or manner of speaking. She said Tt’s a 
catastrophe, because when I begin to speak, it sounds like a man. 5 ” 17 Over the 
course of two years N.T.L. visited the clinic only occasionally. When she and her 
family returned to Vietnam, they never contacted the doctors again. Djourno had 
by then begun planning a multichannel implant, based on the Fourier analysis 
seemingly performed by the human ear, but in 1959 the institute stopped funding 
this line of research (Seitz 2002, 84). 

From the outset, then, implants and their electroacoustic signáis had “politics.” 
Implantation began as a response to deafening — a condition at once physical and 
social — caused, in these first cases, as a by-product of modern medicine. Whereas 
the adult engineer voluntarily immersed himself in a world of strange and even 
painful sounds, this capacity for self-determination was denied to the child. The 
assistive device easily became a stigma Symbol, drawing attention to an otherwise 
invisible disability. Moreover, linguistic politics transferred instantly to this tech- 
nology for communication: Which sounds counted as linguistic? Flow did tones 
correlate to gendered social norms? Should language be prioritized over other kinds 
of sounds? These early experiments raised the question of whether auditory cues, 
no matter how uncanny or uncomfortable, were in fact preferable to “unbearable” 
silence. In an interview near the end of his life, Djourno acknowledged that implants 
were not appropriate for everyone: “There are deaf [people] who do not want to be 
equipped with devices, who are happy with sign language, lip reading; they consider 
devices as a bother ... I knew a family like that, [who used] a very complicated 
and perfected sign language, they fared much better than with any kind of electric 
stimulation . . . To impose a single solution on all the deaf, that’s nonsense.” 18 

In the United States, early volunteers also faced the difficult choice between 
impairment in sonic social settings, or surgical pain, atypical audition, and visible 
stigma. At the same time, a number of these volunteers were technical experts and 
enthusiasts. William House, who designed the 3M device (the first to be granted 
FDA approval), began his research in 1957, when a patient gave him a newsclipping 
about Djourno and Eyriés. 19 House and his brother owned a prívate ear institute in 
Hollywood, founded in 1946 by their father, Howard. 20 In early 1961 House collabo- 
rated with brothers John Doyle and James Doyle, a neurosurgeon and an engineer, 
respectively, to run preliminary tests on E.K., a deaf patient. House and the Doyles 
placed an electrode in E.K.’s inner ear, and the man — himself an engineer at a 
plastics plant — was able to describe the distinct sounds resulting from inputs 
such as pulses and square waves. 21 E.K. then tested a single-electrode implant for 
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three weeks, followed by an insert of five gold wires that had to be surgically removed 
after two weeks due to allergic reaction. 22 Through these bodily demonstrations, it 
became evident that wires required insulation to prevent electrode failure; gold, 
moreover, was not a tolerable material. 23 

John Doyle soon disclosed these experiments to the press, and House dissolved 
their partnership. 24 House recalls, “We began to be deluged by calis from people 
who had heard about the implant and its possibilities. The engineer who had con- 
structed the implant exercised bad judgment and encouraged newspaper articles 
about the research we were doing” (House and Urban 1973, 505). Space Age News 
was a particularly enthusiastic venue for these reports; by and large, cochlear 
implantation was unpopular with the medical establishment throughout the 
1960S. 25 

When Dr. F. Blair Simmons of Stanford entered into cochlear implant research 
shortly after House and the Doyles, Blume notes, “The American Otological Society 
rejected presentation of this work at their 1965 meeting, while an application for 
funding to the National Institutes of Health (NIH) was turned down” (Blume 1995, 
101—102). Simmons conducted an expío ratory multichannel implant surgery on a 
human volunteer in 1964, but then turned to animal research to investígate such 
factors as surgical approach and ideal number of electrodes. 26 Simmons’ move 
from clinical to basic research garnered respectability — and, by the 1970S, NIH 
funding — for further work on the electrical stimulation of the auditory nerve. 

Robín Michelson, who began his career in prívate practice and then moved to 
the University of California-San Francisco (UCSF), similarly became interested in 
the possibility of cochlear implantation through clinical work, which he combined 
with animal research in the midióos. Working with an engineer at Beckman 
Instruments, Michelson implanted several patients with single-channel devices 
around 1970 (Michelson 1971). He subsequently collaborated with a UCSF team 
that included Michael Merzenich and Robert Schindler to conduct basic research 
toward the development of multichannel implants (this research would lead to the 
Clarion model from Advanced Bionics). The UCSF group insisted that this animal 
research eventually be paired with psychoacoustic studies of human subjects. 
(Loeb et al. 1983, 252). 

According to a report written by Caroline Hannaway of the NIH Office of 
History, House resisted animal studies and continued to feel “that at some point 
risks had to be taken. He believed the reluctance of scientists to pursue work involv- 
ing human beings, and what he perceived as the greater readiness of granting agen- 
cies such as the NIH to support projects involving animal experimentation, to be a 
response to the Nuremberg triáis” (Hannaway 1996, 6). House worked in a clinical 
context rather than a university one, and his early research preceded many of the 
current laws regarding human experimentation, such as the requirement of ethical 
review. Extensive records of this research are available in the archives of the 
American Academy of Otolaryngology-Head and Neck Surgery, including rare 
first-hand accounts from test subjects regarding their auditory sensations, personal 
experiments, and proposals for future technology. House’s patient Charles Graser 
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received one of the first portable cochlear implants, a trial device assembled by elec- 
tronics technician Jack Urban. 27 Graser, who is still alive, has permitted me to quote 
from the journals of his field-tests. 

A high school social Science teacher and ham radio operator, Graser drove an 
oil rig in the summers to earn extra money to support his family. In 1959 his truck 
caught fire, and he was severely burned; over the course of several months in the 
hospital, he lost his hearing from the mycin drugs he was given. As a man in his 
forties, he had seen sign language only a few times; he once mentioned that he 
found it “beautiful,” but his family, friends, and employment moored him in the 
English-language world. As a patient at the House Ear Institute in 1961, he first 
learned of the implant experiments; that summer he wrote to William House and 
volunteered to enroll. 

House, at that point, was reluctant to proceed too quickly or to publicize his 
work. Graser wrote to him every six months until the end of the decade, inquiring 
about the possibilities for another experiment. Not until 1968, with advances in 
surgical plastics and miniaturized transistors — as well as the development of other 
medical electronics components in the growing field of artificial pacemakers — was 
House willing to make another attempt (House and Urban 1973). That year, he 
invited Graser to participate in an exploratory surgery with local anesthetic. He 
tried a temporary implant in several locations to determine its ideal placement and 
number of wires. He also tested the máximum intensity of the stimulus, noting, 
“When an intensity of two volts was introduced, he [Graser] responded by jump- 
ing, indicating that the pain threshold had been reached.” 28 This new round of 
human experimentation met with more public criticism, most notably from Dr. 
Nelson Kiang of the Massachusetts Institute of Technology, who called the work 
“premature” (Kiang 1973, 512). 

In 1970 Graser received a permanent “button” implant behind his right ear. 
Having five input wires, it divided the signal into sepárate bandwidths. Over the 
next four years Graser spent thousands of hours with House and Urban in the lab, 
testing different circuits, carrier waves, and modulation schemes. House remarked 
that “many of these devices took months to construct and proved worthless after a 
few hours of testing” (House and Urban 1973, 505). Graser, he attested, was a 
genuine collaborator (see figure 13.2): 

As an ex-ham radio operator he was a sophisticated listener and could fully 
describe the different signáis presented as stimuli . . . He is an ambitious and 
goal-oriented individual who is tenacious in his desire to maximize the use he can 
derive from the implant . . . In addition, he is both articúlate and an excellent 
observen C.G. has been able to communicate to us much valuable information 
concerning his experience with the implant and has made thoughtful suggestions 
concerning ways of upgrading the system . 29 

Graser, on the other hand, was eager to have greater jurisdiction over the exper- 
imental process. In his own log, he recorded the following: “This electronic cochlea 
testing does bother you. It’s like having someone say, ‘Have a seat in the electric- 
chair while I fiddle with Controls.’ It may not hurt, but it is sometimes frightening 
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Figure 13.2 House (left) and Graser (right), c. 1974. Photograph courtesy of the 
John Q. Adams Center for the History of Otolaryngology, Head and 

Neck Surgery, Alexandria, Va. 


in its intensity and your inability to control it. Even to begin with, you don’t know 
how much you can take.” 30 He was finally allowed to field-test a portable implant in 
1972 even though House and Urban were not certain it was safe. There was a risk of 
infection; moreover, fluorescent lights, electrical wires, and highway radar traps 
caused interference. 

Graser was aware that this prototype was never expected to provide “normal” 
hearing. In his field notes he commented, “You would probably describe my current 
progress as changing from profoundly deaf to just hard-of-hearing, but difficulty 
hearing and comprehending is in a completely different league from silence. For 
instance, tonight I can finally hear the bell that indicates that I am at the right hand 
margin, as I type this letter.” In many respects, this early implant was a radically dif- 
ferent and limited kind of ear: “I used to be a radio operator, and sometimes I would 
get a distant signal that I couldn’t really hear. It sounded dim and garbled. That’s the 
way this sounds. It’s definitely an electronic sound.” Yet in some ways, Graser ’s 
acoustic sensitivity exceeded that of his wife: “I will be startled by a brief exhaust 
sound of a car going by outside the house. Barbara doesn’t even hear the car.” 

Graser began tinkering with his processor at home and recorded his findings 
about battery lifespan, microphone type and placement, and signal modulation. 
“I am constantly experimenting with the device,” he wrote to Dr. House (House 
and Urban 1973, 510). He painstakingly documented the transformations of his 
domestic soundscape: “Walking on the floor in the house is not just padding sounds, 
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but is more of a hammering with an echo for each step. Sound i s too sputtery.” 
“You can hear water run into the sink. Almost too noisy.” His first transmitter had 
múltiple diais for control over features such as carrier amplitude and frequency, 
modulation, microphone sensitivity, and high-filter cutoff. Graser recommended 
against standardizing the design: “I would have as many manual Controls on the 
instrument as possible so that each patient could customize sound, much as if they 
were using a short wave receiver.” 31 His field tests led to a number of concrete and 
lasting improvements: carrier waves that received less interference from environ- 
mental electricity; microphones worn at the head rather than in the pocket (where 
they picked up too much “clothing noise”); continued miniaturization, so the pro- 
cessor could also be worn behind the ear. 32 Like other lead users, Graser also con- 
tributed physically (if not deliberately) to surgical procedure; the determinaron of 
suitable implant materials; protocols for minimizing the destruction of hair cells; 
and evidence for the ability of the inner ear to withstand electrical stimulation over 
a period of many years. 

In 1972 an electrode short damaged Graser’s skin, causing House to work 
toward a fully implanted stimulator to replace the “button” model. As part of this 
restructuring, House decided to convert to a single-channel implant. Graser’s “skin 
began to show evidence of retracting and reacting to the external button. Some 
leaking and shorting of the electrodes was observed. House secured all wires into a 
single bundle to prevent loss of the whole system. This event made it urgent that the 
electronics be converted into a single electrode system.” 33 Graser received a fully 
implanted induction coil (as opposed to “hard wires”) at his left ear in 1972, and in 
1974, as a result of “secretions and debris around the button causing the wires to 
fail,” his right ear was finally reimplanted. 34 

Blume regards William House as critical in establishing an international clini- 
cal reputation for cochlear implants. Before FDA approval of the single-channel 
implant (and before the FDA became responsible for medical devices), a very few 
volunteers participated in clinical experiments — each potentially having a large 
impact on design. Throughout the early 1970S Charles Graser served as House’s 
primary evidence in publications and presentations. 35 In those years Nelson Kiang 
rebuked House for assuming that his triáis with Graser were valid: “Enthusiastic 
testimoniáis from patients cannot take the place of objective measures of perfor- 
mance capabilities” (Kiang 1973, 512). Nevertheless, Blume contends that House’s 
“early successes with a simple implant aroused the interest of clinicians in many 
countries, as well as of potential manufacturers” (Blume 2010, 173). Furthermore, 
as Hannaway points out, when the National Institute of Neurological and 
Communicative Disorders and Stroke (NINCDS) “sought the first objective evalu- 
ation of cochlear implantees in the United States” in 1975, it sent just thirteen indi- 
viduáis with single-channel devices — Charles Graser among them — to the lab of 
Robert Bilger at the University of Pittsburgh. “Bilger was a known skeptic about 
cochlear implants, and, as the result of the investigation, he was converted to being 
a modest supporter. So were the others who had previously opposed cochlear 
implants” (Hannaway 1996, 23). 
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Not all of Graseras recommendations survived in commercial versions of the 
technology. In the most significant enduring design shift, House began to build 
portable models with a single control dial, a eliminating the necessity for frequent 
fine adjustments” — in other words, eliminating the personal control and auditory 
customization that Graser so appreciated . 36 This black-boxing of the technology 
compounded disability, implying a lack of technical facility among users, as well as 
an obligatory dependence upon physicians and medical engineers. Moreover, 
although Graser strongly preferred receiving signáis at both ears, bilateral cochlear 
implants would largely be refused by insurance companies for the next thirty-five 
years. Here, the drive toward normalization crossed purposes with the economics 
of health care (see figure 13.3)- 


Signal Processing 


House eventually partnered with 3M Corporation to produce a commercial device; 
in 1984 their single-channel model was the first cochlear implant to receive FDA 
approval. By that time, investigators in Australia, France, Austria, England, and 
elsewhere in the United States had taken up cochlear implants, mostly with an 
interest in marketing a multichannel device. At the University of Melbourne, 
Graeme Clark (whose father was deaf) began work on a multichannel implant 
in the 1970S; he claims that his subjects solicited him after a 1977 news brief in 
the Melbourne Herald. Clark arranged for an ethics committee to oversee his 
research, following the 1975 Declaration of Helsinki amendment regarding 
biomedical experiments on humans. Working with a single patient implanted in 
1979, Clark concentrated on the problem of signal processing: what information to 
extract from the sonic environment for transmission down a limited number of 
channels; the rates at which to stimulate the auditory nerve; the placement of 
electrodes: 

In theory, the coded signáis should simúlate in the auditory nerve fibers the 
temporal and spatial patterns of action potentials seen when sounds excite the 
normal cochlea . . . Therefore, it is necessary to know if speech processors can 
present speech as coded signáis with a more limited number of stimulus channels 
and still adequately simúlate the physiology. Alternatively, the speech processors 
should extract only the essential speech information that can be processed by the 
auditory nervous system via a relatively small number of stimulus channels. 

(Clark 1992, 95) 

Clark tested a number of processors on this first patient, who had been deafened as 
an adult due to a head injury, and he evaluated each strategy according to the man’s 
self-reported ability to perceive speech. Clark ultimately decided, in phonetics 
terms, to build a processor that extracted fundamental frequency and one to two 
speech formants. 
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After implanting a second person in 1979, Clark determined to proceed with a 
commercial device, necessitating a series of clinical triáis on deafened adults. With 
no government funding, Clark could afford to give free implants to only his first six 
patients. Subsequent “volunteers” paid a mínimum of $10,000 to particípate in 


Figure 13.3 Early single-channel implant with fully implanted electrode. (The superfluous 
pointing finger objectifies and diminishes the user of this device.) 

Photograph courtesy of the John Q. Adams Center for the History of 
Otolaryngology, Head and Neck Surgery, Alexandria, Va. 
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experiments that their health insurance would not cover. In 1985 the Australian 
(Nucleus) twenty-two-channel implant was approved by the FDA. 

In the United States, researchers at UCSF, MIT, the Research Triangle Institute, 
and the University of Utah — often in collaboration with one another — generally 
departed from the feature extraction approach in the 1980S. Rather, they attempted 
to build multichannel speech processors, both analog and digital, that simply fil- 
tered and compressed incoming sound waves, with a focus on the frequencies of 
speech. Although the implants developed during this time varied in their numbers 
of electrodes, they shared many other hardware features. Users of experimental 
implants often tested múltiple speech processors (eventually instantiated as soft- 
ware), and processing schemes in turn shared emerging psychoacoustic data. In 
Rebuilt: How Becoming Part Computer Made Me More Human , a “scientific autobi- 
ography” of obtaining an implant, Michael Chorost acknowledges his own debts to 
this second generation of user-researchers, who helped establish how to increase 
the number of electrodes; how to prevent crosstalk between those channels; and 
how best to dissect a signal. He specifically recognizes Michael Pierschalla, who 
participated in the clinical triáis of Ineraid — the third implant to receive FDA 
approval — at the Massachusetts Eye and Ear Infirmary and the Research Triangle 
Institute. a His feedback integrally shaped most if not all of the software that is 
widely used today, not just in my own implant but in those of other manufacturers 
as well . . . Just as the DNA of all my ancestors lives on in me, a bit of his sensibility 
lived on in my software. My own body bore the stamp of his intelligence and 
generosity” (Chorost 2005, 106). 37 Signal processing continúes to be the major 
site of cochlear implant development today, and it develops with a nonlinear 
incrementalism. 

Pierschalla had lost his hearing suddenly at the age of twenty, an effect of 
Cogan syndrome. He studied at the Rochester Institute of Technology and 
became an artist and a noted furniture craftsman. While living in Cambridge, 
Massachusetts, he read a news article about local cochlear implant research. A friend 
wrote on his behalf to Dr. Donald Eddington of the Massachusetts Eye and Ear 
Infirmary; by June 1985 Pierschalla had obtained an experimental six-electrode 
Ineraid implant. 

Pierschalla lamented his estrangement from the hearing and the sign language 
worlds even as he embraced his implant: 

I carne to my hearing loss late, too late to abandon my hearing ways, too oíd to 
change the culture of my nativity, and too resistant to losing the remnants of my 
identity. Thus, I tried to tell them I was like a man without a home. Looked on as 
deaf by those who hear, and as Hearing by the native Deaf, and so for years have 
existed between those worlds, knowing much of both, but embraced and 
understood by neither, longing for the seamless brotherhood of either one, but 
incapable of owning that due to circumstances . . . Finally, one day in the 
laboratory, a man turned on a switch and my head lit up with sounds and alien 
noise and before long I went home and for the first time in years, I called my 
momma on the phone and we both cried like babies, and things have been very 
different after that . 38 
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Pierschalla participated in laboratory as well as field tests of this device: indepen- 
dently trying out different microphones from Radio Shack; documenting the 
recharge times of his batteries; recording environmental sounds (subways, busy 
streets) to play back in the laboratory. While watching a film, he would estímate 
speech intelligibility for off-screen voices and under conditions of theater noise, 
múltiple talkers, and “background” music. Lecturing at the Museum of Fine Arts in 
1992, he reflected on his interactions with physicians and scientists: <c As a craftsman 
and an armchair philosopher, I’ve sometimes wondered where the tool ends and 
the art begins. In the same way, when Ym in the lab with these scientists, we at our 
opposite ends of the same wire, I wonder where the observer ends and the subject 
begins. And I don’t have an answer.” 39 On múltiple occasions, he co-presented 
cochlear implant findings with otolaryngologists at medical conferences. 

Shortly before Pierschalla’s death in 2002, Science writer Víctor Chase inter- 
viewed him over the phone. Chase noted the man’s transition fr om patient to expert: 

Pierschalla’s knowledge and success was widely known within the cochlear 
implant community. As a result, in 1995 Med-El, a cochlear implant manufac- 
turer based in Innsbruck, Austria, asked him to establish its North American 
office in Boston, as an organization of one. He accepted, and shortly thereafter he 
relocated the company’s office to Research Triangle Park, North Carolina. He 
then worked on obtaining FDA approval for sale of the Med-El implant in the 
United States. (Chase 2006, 26) 

In 1995 Pierschalla served as a member of the second NIH Consensus Conference 
on Cochlear Implants in Adults and Children. The panel concluded that the 
population of implant users was steadily growing, that the device seemed to be 
successful in children, and that “cochlear implants should be made available to 
adults with severe hearing loss,” as well as those who were “profoundly” deaf 
(Hannaway 1996, 53). Along with this medical consensus, one by-product of the 
many years of advocacy by late-deafened implant users was thus a renewed medical 
encroachment upon the Deaf world. 

Chorost worried that his own perception — and, in turn, his perspectives — 
might be unduly influenced by the anatomy and predilections of the test subjects 
whose “scripts” he had inherited. He quickly learned that the prosthesis “doesn’t 
change your valúes in the slightest” — although it does embody an “epistemology,” 
complete with certain biases (Michael Chorost blog, comment posted March 2, 
2009). The drive of early investigators toward speech processing, for instance, is 
inscribed into his implant. Thus, music sounds “fíat and dull . . . ñutes and soprano 
saxophones sounded as though someone had clapped pillows over them . . . oboes 
and violins had become groans” (Chorost 2005). In an article written for Wired 
magazine ( t£ My Bionic Quest for Boléro ”) Chorost discusses his own participation 
in numerous software upgrades and engineering experiments in order to expand 
the frequency range of his sixteen channels. He further suggests that by flying 
“to labs around the country with [his] own agenda — to try out their software spe- 
cifically with music,” he’s “gotten them to focus on music sooner than they might 
have otherwise” (Michael Chorost blog, comment posted January 5, 2006). 




336 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


Philip Loizou, an electrical engineer at the University of Texas, Dallas, notes 
that speech-processing strategies also contain biases against environmental sounds, 
noisy environments (often due to unrealistic laboratory testing situations), and 
tonal languages (which convey information through pitch rather than simply for- 
mants/timbre) (Loizou 2006). 40 Furthermore, the endemic electromagnetic inter- 
ference between implants and wireless or cordless phones indicates the primacy of 
face-to-face communication for implant design — not to mention the long-standing 
disregard for hearing aid and implant users within the telecommunications indus- 
try. 41 Finally, Chorost reminds that the design of cochlear implants is of course 
constrained by economics. Although his initial device allowed him to switch 
between two speech processing strategies, one analog and one digital, an upgrade 
reduced him to just one, “because audiologists simply don’t have the time to fit all 
of their patients with two different kinds of software.” (Chorost 2005, 189). 


Deaf Futurism 


Narratives that depict the history of cochlear implants as a binary conflict 
between Deaf culture and normative biomedicine additionally obscure the radical 
aspirations of a minority of deaf implant users. In a 1999 Hastings Center Report , 
G. Q. Maguire and Ellen McGee contended that cochlear implants had “set the 
stage” for neuroenhancement: 

Three stages in the introduction of such devices can be delineated. The earliest 
adopters will be those with a disability who seek a more powerful prosthetic 
device. The next step represents the movement from therapy to enhancement. 

One of the first groups of nondisabled “volunteers” will probably be in the 
professional military, where the use of implanted computing and communication 
devices with new interfaces to weapons, information and Communications could 
be life-saving. The third group of users will probably be people involved in 
information-intensive businesses who will use the technology to develop an 
expanded information transfer capability. (Maguire and McGee 1999, 9) 

This linear evolutionary trajectory assigns deaf people to the primitive State 
and suggests a structural need for individuáis with disabilities to serve as test beds 
for new technologies. It ignores the possibility that any person might desire 
“enhancement” rather than therapy from a prosthetic device. 42 

A small subset of “deaf futurists” has always maintained an investment in tech- 
nical innovation and posthumanism. Unlike the futurist art movements of the early 
twentieth century — which tended to reject the poor, the nontechnological, and the 
unfashionable, along with the “traditional” — minority futurisms counter histories 
of exclusión in Science and engineering. 43 

Often unrecognized, deaf scientists and technophiles number among the 
leading theorists of implant futurity. Internet founder and Google vice president 
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Vinton Cerf, who wears two hearing aids, previously asserted an enduring relation- 
ship between deafness and Computer networking. “In creating the Internet with my 
colleagues,” he has stated, a in part I wanted to help people with hearing loss as well 
as other communication difficulties” (Better Hearing Institute 2007). 44 Years later, 
for the Time magazine special report on “visions of the 2ist century,” Cerf predicted 
web-enabled implants: “The speech processor used today in cochlear implants for 
the hearing impaired could easily be connected to the Internet; listening to Internet 
radio could soon be a direct computer-to-brain experience!” (Cerf 2000, 103). 

Cerf s wife, Sigrid, whom he met at a hearing aid center, acquired an implant in 
1996. In a short story written for The Little Magazine, he lists the ways “the little deaf 
girl” had grown up and begun experimenting with technology: 

Her aggressive approach to hearing led her to obtain and in some cases invent 
assistive methods to augment the basic speech processor/implant combination. 

She had patch cords made to connect her speech processor to the armrest of 
airplane seats so she could hear the movie sound track directly rather than 
through audio headphones. She obtained books on tape and listened to them too 
through a patch cable connecting the sound output of the tape recorder directly 
into the auxiliary audio input of her speech processor. She had numerous 
microphones made on wires ranging from six feet to sixty feet in length so she 
could put a microphone cióse to the speaker at lectures or at the dinner table. She 
obtained FM transmitters with microphones built in, so she could put the 
transmitter on a lectern and then, using an FM receiver, listen to the speaker — 
again using direct audio input into the speech processor. She obtained infrared 
receivers to pick up the sound track in movie theatres which are equipped to 
transmit this signal to receivers for the hearing impaired. 

She had patch cords made to plug into her mobile telephone and used 
magnetic telephone coils with patch cords for use with ordinary telephones. 

The list goes on and on. (Cerf n.d.) 

Sigrid herself has intimated that she would one day like to link to the Internet via 
her implant (Hamilton 2002). 

Similarly, Mike Chorost comes out as a cyborg in his autobiography (which 
was endorsed by Manfred Clynes). Part human and part machine, he describes 
himself as at once limited and endowed with new capacities. His implant is an object 
lesson about the imperfections of the technology, the ways normalization can be 
desired alongside enhancement, and the fact that “posthumanism” can coexist with 
impairment. He marvels about patching into his Walkman and “hearing music that 
never actually exists as sound. This could be evidence of a profound transformation 
in how human beings take in information from the world around them.” He con- 
tinúes, reflecting on the inevitable domestication of his device, “Or, it could just be 
a cozy domestic scene: a cyborg and his cat” (Chorost 2005, 58). 45 

In a piece for The Futurist magazine, Chorost muses, “While my friends’ ears 
will inevitably decline with age, mine will only get better.” He warns of the potential 
costs to privacy and security of networked implants: “Neural devices such as 
cochlear implants are computers and can be wirelessly networked. People’s bodies 
and brains could become visible to the global network in ways we can only dimly 
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imagine now, much the way people could barely imagine the impact of Computer 
networking back in the age of Apple lis” (Chorost 2006, 68). 46 At the same time, 
Chorost criticizes the escalating barriers to users’ control. Present-day implants 
allow users to adjust volume, microphone sensitivity, and “prívate tones” (which 
give cues about the functioning of the device), as well as to choose among three or 
four “customized” programs based on highly generalized listening environments 
(i.e. “quiet,” “noisy,” and “focus”). Paren ts and teachers can use a remóte control 
to set these programs at a distance for children wearing implants. For adults and 
children alike, choice of speech-processing software is limited and software settings 
(i.e. amount of stimulation per electrode) must be set by a clinician. 

Responding to a blog entry by Cory Doctorow titled “Deaf hacker rewrites 
implant-firmware so he can enjoy music again,” Chorost counsels: 

I didn’t actually rewrite the software. To do that Td need a degree in electrical 
engineering, an insider’s knowledge of the code, and an understanding of how 
electricity interacts with body tissues. If I broke into Advanced Bionics in the 
dead of night, fired up the computers, hooked up the interface gadgetry to my 
processor, and started changing code at will, I could cook my inner ear or electro- 
cute myself. Much more likely, my processor would crash and I’d slink out in 
total humiliation. I’d be deaf until I went back to my audiologist to have the 
software re-uploaded . . . Cory is right that the more control a person can have 
over his or her prosthesis, the better. I would love it if Advanced Bionics built a 
gadget that let me change some of my parameters on my own. While I have a lot 
of control over the device through my audiologist, it’s control that takes weeks 
and months to unfold and explore. My audiologist and I experiment with various 
parameters during fitting sessions, and it always frustrates me that we only have a 
few hours in each session to try to find the best valúes for 20 or more variables. 

I would love to have the freedom of playing with them on my own. That would 
considerably accelerate the process. But it is possible in principie to hack one’s 
own implant. (Michael Chorost blog, comment posted January 5, 2006) 

The complexity of electronic objects such as cochlear implants means that most 
users will not be able to modify them at will. Moreover, the drive to standardize 
medical devices means that most examples of “deaf futurism” exist in theory rather 
than being instantiated in the material-semiotic register of signáis. 


CONCLUSION 


A comprehension of the politics of circuitry and software enables, on its own, a 
small measure of perceptual control. Along these lines, media theorist Vilém Flusser 
once argued that the very constructedness of his hearing aids provided an advantage 
over “normal” hearing. Having fled to Brazil in 1940, Flusser managed a radio 
and transistor factory before turning to the philosophy of communication. 
In a manuscript titled “Hoerapparate,” he suggested that “the hearing aid is 




DO SIGNALS HAVE POLITICS? 


339 


freedom” — and he wondered £< why there has not been more philosophical writing 
on the subject.” Deafness plus a hearing aid meant that he had “ear-lids”; he could 
choose when to be immersed in the world’s noises and voices. More important, he 
felt that his hearing aid helped him to see and hear better : on the one hand, to be 
aware of the manufacture and obstructions present in all communication; on the 
other hand, to understand the programming behind his own auditory perception: 

If you are listening to the world, you will notice that sounds are “instrumen- 
tized.” Not a white buzzing that comes to the ears, but an orchestrated swinging. 

A programmed noise. Therefore it must be supposed that between you and the 
world there is a sound-sieve turned on, a hearing aid. The unpleasant, even 
unacceptable thing about this apparatus is that one cannot see it. Therefore one 
cannot know who programmed it. If its program is coming from the world out 
there or from you yourselves, for instance through the way your ear is built. Even 
good oíd Kant puzzled his head on this subject. My own hearing aid is visible. 

One knows who programmed it, a Japanese company. And this finally is an 
advantage I have in comparison to you. I can, better than you are able to, see 
through my hearing aid. And therefore hear better than you. 47 

In much the same terms, Chorost recognizes the epistemological valué of the cochlear 
implant. After an extended analysis of Donna Haraway’s “Cyborg Manifestó,” he 
weighs in against ££ the blithe assumption that one’s sensory organs deliver a truthful 
representation of the universe...people with normal ears are not off the epistemo- 
logical hook, because their “software” was written haphazardly by millions of years 
of evolution and has no greater claim to reality” (Chorost 2005, 147). 48 

The history and consequences of communication engineering, particularly at 
the level of electrical signáis and machinic filters, must be considered more broadly 
within sound studies. Electroacoustic devices — with sleek casing, miniaturized cir- 
cuitry, and confidential corporate histories — increasingly resist “seeing- through.” Yet 
in the case of cochlear implants, the desires of early users, the conflicting demands 
of mainstream medicine and economics, and the mediated features of electrical 
listening — in other words, the politics attendant upon communication — can be 
found embedded in the design of electroacoustic objects. 


NOTES 


1 At BBN, Swets notes, researchers “thought of computers as symbol processors — for 
example, theorem provers and pattern recognizers — rather than as number crunchers . . . 
these psychologists would lend what they knew about human perception, thinking, language, 
and motor control to the design of computers that would augment or supplant human 
behavior in, for example, librarles, process control, and robotics” (Swets 2005, i5> 18). Paul 
Edwards traces the Harvard PAL transition “from wartime work on human-machine 
integration to postwar concerns with information theory to the Computer as a metaphor for 
the human mind.” The PAL scientists investigated problems such as communication between 
humans and complex military machines (e.g., submarines, airplanes); the transmission of 
speech in battlefield noise; the jamming of enemy Communications; training in listening and 
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articulation for military personal. These scientists Cí made no distinction between the technol- 
ogy of hardware and the technology of language and listening” (Edwards 1996, 210, 214). 

2 Approval followed for children over the age of two in 1990 and for twelve-month- 
olds in 2000. 

3 These users are mostly wealthy individuáis or citizens of countries with socialized 
medicine, as the procedure costs approximately $30,000 (Spelman 2006). On poverty and 
the global distribution of implants, see Zeng (2007). The NIDCD website maintains current 
statistics on cochlear implant use: http://www.nidcd.nih.gov/health/hearing/coch.asp. 

4 “It is an unequal battle since the two accounts differ greatly in their authority. The 
medical understanding draws on and reflects the authority of Science and the promise of 
medicine, both of which have become fundamental components of modern industrial 
culture. The deaf perspective draws on and reflects the experience of a traditionally 
marginalised and stigmatized group” (Blume 1999, 1265-66). 

5 Brenda Jo Brueggemann has recently suggested that the deaf/Deaf distinction has 
been ineffective and might even be unrealistic. In this chapter, however, I follow these 
conventions of capitalization when referring to the audiological (“deaf’) versus linguistic 
minority (“Deaf’) definitions of deafness (Brueggemann 2009, 9-15). 

6 In this book, Blume ineludes a short section on “adult implantees” and the potential 
for their experiences to complicate the Deaf/hearing binary, as well as any reductive 
audiological definition of deafness: “The boundary between deafness and hearing is a 
complex región, marked by valúes, memories, histories, and commitments, and to be 
crisscrossed in many ways.” See pages 163-70. 

7 To the contrary, Blume insists that patients are “typically not seen as competent 
interlocutors in the innovation process” (Blume 1997, 32). 

8 Trevor Pinch and Nelly Oudshoorn elabórate as follows: 

[I]n the design phase technologists anticípate the interests, skills, motives, and 
behavior of future users. Subsequently, these representations of users become 
materialized into the design of the new product. As a result, technologies contain a 
script (or scenario): they attribute and delegate specific competencies, actions, and 
responsibilities to users and technological artifaets. (Oudshoorn and Pinch 2003, 9) 

See the chapters by Oudshoorn and Blume and Rose regarding the significance of clini- 
cal triáis or research and development to user configuration. 

9 Helen Nissenbaum asserts that “computer and information systems can embody 
valúes,” and these must be considered in tándem with their “social effeets” (Nissenbaum 
2001, 120). 

10 Blume surmises that the initial disinterest of the Deaf community led implant 
manufacturers to direct marketing pressure toward the hearing parents of deaf children. 
The marketing focus has since expanded to inelude late-deafened seniors, by far the largest 
population with severe hearing loss (Blume 2010, 116-117, 144). 

11 Here he draws on the work of Rayna Rapp and Faye Ginsburg (Blume 2010, 162). 

12 Bernard Frayssend to Dr. House, June 11, 1978, “Djourno/Eyriés Publications” 
Folder, Cochlear Implants, 1961-1995 Collection, John Q. Adams Center. Drawing on this 
collection, Marc Eisen (also an otolaryngologist) has written two short articles on the early 
French implant (Eisen 2003, 2006). 

13 Phillip Seitz, “Interview with André Djourno and Daniéle Kayser 1/12/1994,” 
612-OH-11, Cochlear Implants, 1961-1995 Collection, John Q. Adams Center. See also Seitz 
(2002). 
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14 As Eisen narrates, Djourno and Eyriés ran into each other at the medical school 
morgue, where Eyriés was looking for a potential nerve transplant for Monsieur G. 
Djourno convinced him to try an implant instead (Eisen 2006, 3). 

15 Eisen explains that Eyriés would have preferred to contract an engineering firm to 
manufacture the implant (Eisen 2003, 503). Blume offers a different interpretation of the 
conclusión to this experiment; he says the patient “decided that he had had enough: it was 
not worth the investment of so much time and emotion” (Blume 2010, 32). 

16 Phillip Seitz, “Interview with André Djourno and Daniéle Kayser 1/12/1994,” 
612-OH-11, Cochlear Implants, 1961-1995 Collection, John Q. Adams Center. 

17 Ibid. Djourno also claimed that the implant allowed her to “hear her own voice for 
the first time in eight years” (Djourno and Vallancien 1958, 555). 

18 Phillip Seitz, “Interview with André Djourno and Daniéle Kayser 1/12/1994,” 
612-OH-11, Cochlear Implants, 1961-1995 Collection, John Q. Adams Center. 

19 William House, “Cochlear Implants: The Development of an Idea,” February 1976 
(typescript), p. 9, 921-HSB-2, Cochlear Implants, 1961-1995 Collection, John Q. Adams 
Center. 

20 For the story of Howard’s own unconventional approach to medicine, see Hyman 
(1990). 

21 The year before, James had designed similar equipment to record the sounds 
produced by the auditory nerve of a person with tinnitus. At the time he realized, 

“Well, gee, if we can do this, we ought to be able to reverse the process and make the 
person hear that’s deaf.” Even though the biophysics of hearing had not been clearly 
determined, James felt that “as an engineer you always look for a practical way of making 
something work, whether it’s the absolute truth or not.” Phillip Seitz, “Interview with 
John and James Doyle (8/22/93),” 612-OH-9, Cochlear Implants 1961-1995 Collection, 

John Q. Adams Center. 

22 A woman, S.S., was implanted at the same time, but her electrode had to be 
removed almost immediately due to poor insulation. 

23 In 1973 E.K. volunteered for another round of experiments, this time with new 
materials. 

24 John Doyle claimed his slides had been stolen from a meeting of the American 
Medical Association (Schmeck 1962). The Doyle brothers continued with this research for 
several years until lack of funds forced them to abandon the project (Doyle et al. 1963). 

25 “California Electronics Firm Readies ‘Artificial Ear’ Implant,” SpaceAge News 3(18) 
(1961): 1; “Electronic Firm Restores Hearing with Transistorized System in Ear,” SpaceAge 
News 3(21) (1961): 1. Radio -Electronics carried many of the earliest reports of the French 
implant. For instance, “News Briefs: Electronic Ears,” Radio -Electronics 29 (December 
1958): 6. 

26 These first experiments by Simmons did not result in take-home implants. 

27 Credit for the first functional portable implant remains controversial within the 
otolaryngology community. Although House is often recognized for developing the first 
successful take-home implant technology, Michelson’s experimental devices, described in 
a 1971 publication, were cióse competitors. 

28 William House, “Cochlear Implant: Hope for the Nerve-Deafened Person: A 
Decade of Progress” (typescript, n.d.), p. 4, 921-HSB-2, Cochlear Implants, 1961-1995 
Collection, John Q. Adams Center. 

29 Ibid., 27, 29. 

30 Charles Graser Papers, 921-HSG, Cochlear Implants, 1961-1995 Collection, John Q. 
Adams Center. For a similar sentiment two decades later, see Chorost 2005, 163. 
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31 For all quotes in this section, see Charles Graser Papers, 921-HSG, Cochlear 
Implants, 1961-1995 Collection, John Q. Adams Center. 

32 Today, some processors are again available as body-worn devices, marketed for 
their relative invisibility. 

33 William House, “Cochlear Implants: The Development of an Idea,” February 1976 
(typescript), p. 28, 921-HSB-2, Cochlear Implants, 1961-1995 Collection, John Q. Adams 
Center. House and Urban had, moreover, not been able to elimínate crosstalk between 
electrodes. 

34 They found that the silver wire had not corroded after four years. William House, 
“Cochlear Implants: The Development of an Idea,” February 1976 (typescript), p. 36, 
921-HSB-2, Cochlear Implants, 1961-1995 Collection, John Q. Adams Center. 

35 C.R., who was implanted at the same time, had late-stage syphilis and moved away 
from the Los Angeles area. 

36 William House, “Cochlear Implant: Hope for the Nerve-Deafened Person: A 
Decade of Progress” (typescript), p. 8, 921-HSB-2, Cochlear Implants, 1961-1995 
Collection, John Q. Adams Center. 

37 In Cambridge, the cochlear implant program was a joint effort of the Infirmary, 
Harvard Medical School, and M.I.T., coordinated by Dr. Donald Eddington. Another 
physician described Pierschalla as “more than a test subject ... he was a valued colleague, 
an integral contributor, and a friend” (Chorost 2005, 106). 

38 Michael Pierschalla, 1990, typescript in the author’s possession. 

39 Ibid., 4. 

40 Biases against tonal languages within signal processing have long been of concern 
in the hearing aid industry. See as one example McCullough, Tu, and Lew (1993). 

41 See Goggin (2006, ch. 5). “Politics” also reside in the medical determinaron of 
implant candidacy: psychological criteria, as well as willingness to maintain a relationship 
with the clinic for implant maintenance, are often taken into consideration. As a 

last example, experimental devices were designed with more or less flexibility for 
upgrades. 

42 For a more thorough discussion of the ways the same technology can be used for 
either treatment or enhancement, see Parens (1998, 1-2). 

43 A different iteration of this concept can be found in Afrofuturism, defined by 
sociologist of Science Alondra Nelson as instances of “sci-fi imagery, futurist themes, and 
technological innovation in the African diaspora” (Nelson 2002, 9). The heterogeneity of 
the deaf community means that examples of “deaf futurism” vastly exceed cochlear 
implants. Members of Deaf culture who reject cochlear implants, for instance, may 
embrace a range of “deaftechs” from wireless relay Services to videophones to motion 
capture to ASL vlogs. More important, Brenda Jo Brueggemann calis attention to the fact 
that the number of Gallaudet students with cochlear implants “has virtually doubled itself 
each year” in recent times; the relationship between Deaf culture and this technology is not 
inevitably one of opposition (Brueggemann 2009, 16-17). 

44 In 1978 Cerf reported on “The Electronic Mailbox: A New Communication Tool 
for the Hearing Impaired” to the readers of American Annals ofthe Deaf. At that time, the 
U.S. Department of Health, Education, and Welfare (in collaboration with Bolt, Beranek, 
and Newman) was testing email on the deaf population of Framingham, Massachusetts 
(Cerf 1978, 771). 

45 Chorost’s current implant model allows him to use standard earbuds and skip the 
patch cord step. Mike Chorost, in discussion with the author, June 17, 2009. 
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4 6 On this note, Chorost’s forthcoming book examines the emerging technology of 
optogenetics. On controlling genetically altered neurons with light see Chorost (2007). 

47 Vilém Flusser, “Hoerapparate” [Hearing Aids], trans. Silvia Wagnermeier. Vilém 
Flusser Archive, Berlin University of Arts. Thanks to Siegfried Zielinski for alerting me to 
this document and Silvia Wagnermeier for providing the translation. 

48 Chorost disagrees with Haraway’s expansive definition of “cyborg,” preferring to 
apply the term only to cases of cybernetic technology — that which “exerts control over the 
body” (Chorost 2005, 41). 
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SECTION V 


EDITING 
SOUND: THE 
DESIGN STUDIO 




CHAPTER 14 


SOUND AND 
PLAYER IMMERSION 
IN DIGITAL GAMES 


MARK GRIMSHAW 


Introduction 


The perceptions arrived at from different senses often combine either to provide 
further information about the environment or to confirm the information already 
provided by one sense. This is particularly the case with sight and hearing, which 
may, in some cases, be one perceptual phenomenon. A ventriloquist’s dummy or a 
movie screen have no sound-producing capabilities, yet sound from elsewhere is 
perceptually located on the moving mouth or parts of the film image. Even when 
the sensations of sight and sound are temporally separated, they may still combine 
to make one perceptual phenomenon; a gunshot might be heard some time after a 
muzzle flash is seen, but the firing is perceived as one event. However, notwith- 
standing the perceptual effects of combined sensory phenomena, removing all sen- 
sory information but that derived from my hearing and combining it with general 
experience and knowledge, I would still be able to describe in great part the envi- 
ronments in which I may be situated. Even when the other senses are available, 
I often rely solely on sound to inform me. For example, from the sounds of vehicles 
passing unseen on the road outside, I can, to varying degrees of accuracy, ascertain 
the direction they are traveling in, their speed, the type of vehicle, and the density 
of traffic. From this latter deduction, I can further deduce the time of day, for 
example. Furthermore, as a single taste can spawn a monumental novel, unseen 
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sound sources can provoke images; founded upon experience, specific denotations 
derived from a visualization of the sound-producing object or activity or, typically 
more varied, connotations that may be communal in their meaning or quite indi- 
vidual. Conversely, seen but unheard objects and activities can be mentally sounded; 
the sight of a coin spinning ever more slowly on a table top will lead to a mental 
sound object closely synchronized to the object’s material, form, and action. 
At the very least, assuming both senses are functioning normally, if visión and 
hearing are not perceived as one perceptual event, there is a strong correlation 
between them. 

This is a chapter about sound in digital games — games played on gaming con- 
soles and home computers — and how the design of sound for such a médium 
contributes to player immersion in the game world, especially in worlds designed to 
be immersive. This statement encapsulates an assumption: that sound in such digi- 
tal game worlds contributes to the immersion of the player or, at the very least, that 
the game designer uses sound in an attempt to facilitate such immersion. Before 
this assumption can be tested, a number of questions raised by that assumption 
must be answered: What is sound in the digital game? What is the relationship between 
sound and image? Is there a difference in the use or perception of sound in the real 
world and in virtual worlds? and What is meant by immersion? To ¿Ilústrate the 
answers to these questions, sound use in first-person perspective games, of which 
the most notable subgenre is the first-person shooter (FPS) game, will be used. 
Such games have an immersive premise; on-screen, the player is typically presented 
with a pair of arms receding perspectivally into a representation of a three-dimen- 
sional visual space; the implication is that the pixellated arms are the player’s own 
arms extending into and interacting with the game world. To parallel this, the player 
is positioned in a field of sound as a first-person auditor, and that field contains 
sounds that change their location as the player’s character moves and have a dynam- 
ically processed reverberation (particularly in modern games) to approximate the 
effects of the materials and spaces in the game world. There are many game genres 
beyond those using the first-person perspective, but the latter offer a greater range 
of immersive possibilities as afforded by sound; indeed, not all digital games can be 
described as immersive at all. However, the study of immersion in those games that 
do invite it is important from an STS perspective that seeks to understand the effects 
of game sound experience on technology and design and vice versa. Immersion is 
actively pursued in the design of some games, yet its attainment is little understood, 
and still less understood are the long-term implications of increasing periods of 
immersion in virtual worlds. 

Throughout the chapter I appear to be suggesting that real world and virtual 
world are distinct contexts, polar opposites, and ne’er the twain shall meet. While 
this serves the purpose of brute analysis, the reality is somewhat less clear cut, and I 
beg the reader’s indulgence in my use of the artifice of polarity. It is a means to an 
end, and that end, as will be clear at the end of the chapter, is to demónstrate that, 
while differences do exist, such absolute distinctions are a nonsense where sound 
use and player immersion are concerned. 
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Sound in Digital Games 


My hands clutch the machine gun nervously as I survey the scene before me. I am 
amid the ruins of a French village. A single bell tolls in the church tower behind me 
while pigeons flap and coo around it. An aircraft roars overhead, and I have just 
passed a sign creaking disconsolately on one rusty hinge and offering vin rouge. 
Somewhere, a record of Edith Piaf crackles through the empty streets, stuck on one 
lingering phrase. A tank burns in the distance, and the sound of muffled explosions 
and gunfire comes from the left and ahead behind a pile of rubble. The static of 
radio occasionally updates me on my comrades’ status while I edge forward cau- 
tiously. Footsteps gradually impinge on my consciousness, and I whirl around, 
expecting to engage with the enemy. Nothing. All of a sudden I see a grenade are 
gracefully through the air toward me, landing with a metallic, clattering bounce off 
the wall in front of me. As I jump back, the grenade explodes, and I take a hit in my 
leg, causing me to involuntarily grunt in pain and limpingly seek cover. Edith sings 
of regret to the accompaniment of the lonely bell. A head appears over the wall, and 
I instinctively empty a clip of my machine gun at it; blood spatters my face, and 
I hear a scream that satisfyingly mingles with the musical tinkle of empty, ejected 
shells. Silence. I stop to bandage my wound. 

Analyses of sound and sound use in digital games — of which the scenario just 
now sketched is typical — usually take as their starting point principies defined in 
cinema theory. As an example, the notion of diegetic and nondiegetic sound has 
been imported from film criticism (Curtiss 1992; Chion 1994, for example) to game 
sound theory by a number of authors (such as Grimshaw 2008a; Jorgensen 2006). 
For games, following definitions in cinema, a simple assertion is that diegetic sound 
is the sound that derives from the internal logic of the game world and that nondi- 
egetic sound is all other game sound such as the musical score and menú interface 
sounds — this chapter deais with diegetic sound as just defined and does not con- 
cern itself with nondiegetic sound. Similarly, the notion of acousmatic sound has 
been borrowed following its development from electroacoustic composition by 
film theorists such as Chion (1994). In the case of digital games, as for film, acous- 
matic sound is off-screen sound — a problematic definition for some (Metz [1985] 
disputes the on-screen/off-screen distinction) while others argüe that technologies 
such as surround sound blur distinctions between on-screen and off-screen [Chion 
1994, 129-31] ) — but having the benefit of denoting sound that derives from a source 
not seen on the screen. 

No semantic classification is ever black and white, and there are several gray 
areas in the digital games application of terminology such as diegetic and acousmatic 
that are best illustrated by a brief discussion of the mechanics and technology of 
game sound. In the first instance, the soundscape of a digital game is different each 
time the game is played. Film soundscapes may change with censorship, dubbing, 
reproduction equipment, and interaction with the external acoustic environment. 
Essentially, though, the sound track on the médium made available to distributors 
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and viewers is fixed at the point of production. This is not the case with digital 
games, which, as a highly interactive activity, come not with a fixed sound track on 
the distribution médium but with discrete audio objects used as required during 
gameplay to create a compound soundscape that is unique at each playing. Early 
games used real-time sound synthesis, and games in the future may well return to 
synthesis as a means of sound creation. Current games (as of 2009) with few excep- 
tions make use of audio samples that take advantage of the large capacity of modern 
digital-storage media. These audio samples, be they intended as diegetic or nondi- 
egetic, are sounded at the player’s command or by the game engine in response to 
the current game State (which itself is dependent upon the actions of the player or, 
in multiplayer games, players). The individual, unprocessed audio samples for any 
particular game may remain the same, 1 but the resultant soundscape is different at 
each playing of the game because the player’s actions in creating that soundscape 
are different at each playing. Where the intended soundscape of film is fixed at the 
point of production, digital game soundscapes are created anew at the point of 
reproduction. 

Given these differences then, it is no surprise that, while similarities may exist 
between film sound and game sound, there are also significant differences that war- 
rant care in the transfer of sound theories and concepts from film to digital games. 
Defining a film’s musical score as nondiegetic is fraught with difficulty not least 
because of an ambiguity that is often exploited by the director. For example, the 
musical score, apparently nondiegetic, 2 that unmasks itself as diegetic through a 
panning shot that reveáis a radio or an orchestra (Count Basie’s Orchestra anach- 
ronistically in the desert in Blazing Saddles [Brooks 1974] is an extreme case in 
point). Furthermore, where the film’s musical score typically folio ws the action on- 
screen, perhaps heightening the emotional impact or aiding in the intended inter- 
pretaron of the scene (thus, in some cases it may be interpreted as having diegetic 
aspects to it), the technological génesis and the interactive nature of digital games 
make the boundary between diegetic and nondiegetic sound even more porous. All 
digital games are predicated upon the actions of the player, and there are many 
instances in which sections of the game’s musical score are played only in response 
to certain player actions. Rez (Mizuguchi 2001) is an extreme example where the 
musical score derives almost entirely from the player’s actions. Notwithstanding 
these conceptual debates, the design of many digital games (especially FPS games) 
does recognize a distinction between diegetic and nondiegetic sound through the 
inclusión of sepárate volume Controls for sound effects and music. The “hunter and 
the hunted” premise of such games requires that the player be particularly attentive 
to game-world sounds — headphones aid further in excluding sounds external to 
the game — and thus it pays to turn the music off. 

Acousmatic sound is similarly problematic as a concept particularly when 
applied to digital game sound. As Stockburger (2003) States, the player has kines- 
thetic control over the display (or not) of sound sources. Whereas in film, such 
visualization and acousmatization of sound is controlled by the director, in digital 
games, particularly FPS and other first-person perspective games, it is under the 
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control, for the most part, of the player. Where an acousmatic sound is heard from 
behind the position of the player’s character, the player can maneuver the character 
within the game world toward the source of the sound so that that source becomes 
displayed on-screen. 


The Relationship between Sound 

AND ImAGE IN THE GAME 


I have already hinted at a strong relationship between sound and image not only in 
the scenario I sketched at the start of this chapter but also, first, in the suggestion 
that sound and image might be perceived as one event and, second, through the use 
of terms such as visualization and seen sound source. Upon hearing a sound (where 
the source remains unseen), a person will use experience to attempt to create a 
mental image of the sound source; a form of visualization exploited by directors 
and sound designers in the horror genres of both films and digital games. Gaver 
(1993) goes further in suggesting that sounds inherently betray the broad defini- 
tions of their sound sources; large objects tend to produce lower-pitched sounds 
than smaller objects, and particular events, such as bouncing, have a characteristic 
sound pattern. Such aural signatures, Gaver proposes, might be the basis for the 
synthesis of caricature sounds (by analogy with visual caricature as opposed to 
representative photography), a notion to which I return later. 

The game’s sound sources displayed on the screen are, as in film, not the 
actual sound sources. They are apparent sound sources, and all sounds and their 
sources are, quite literally, offscreen. This is because of the displacement of image- 
reproduction hardware compared to sound-reproduction hardware. Sound is not 
reproduced by means of the screen but is reproduced by headphones or loudspeak- 
ers that are often quite distant from the images displayed on the screen. 3 Disregarding 
the effect of the reflection of direct sound, the physical distance between sound 
(re)production and (apparent) sound source is one of the major differences between 
sound-image phenomena in the natural world and sound-image phenomena in the 
virtual worlds of digital games. As I write in my room, the hum of my fridge really 
does emanate from the fridge I see before me, and the scratch of my pen arises 
directly from the moving point of contact between pen and paper. Nevertheless, the 
sensations that arise from disjunct sound and image sources can be perceived as 
one event. This perceptual co-location of sound and apparent sound source has 
been described as synchresis or synchrony in cinema (Anderson 1996; Chion 1994) 
but is also known in acoustics and psychoacoustics as the audio-visual proximity or 
ventriloquism effect. In part, the conjunction has to do with experience (it becomes 
obvious that the music heard from loudspeakers is actually the music played by the 
orchestra on-screen); in part it has to do with the cióse synchronization between 
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sound events and moving images (if an object on-screen starts to move, then it may 
well be “producing” the sound that started simultaneously); and in part it has to do 
with correct mental visualization of sounds (as in the orchestra mentioned earlier 
but also on the presumption that fundamental parameters of sounds — frequency, 
intensity, timbre — betray the general physical outlines and material properties of 
their sources). It is this perceptual cohesión between the results of physically dispa- 
rate sound- and image-reproduction technologies that plays a major role in player 
immersion in the game world. Paradoxically, it is the distortion of reality inherent 
in the sepárate reproduction systems that is a contributing factor to allowing the 
player to respond to the game world as if it were real. 


Game Technology — Real World, 

Virtual World 


The differences and similarities between real-world sounds and sounds as mediated 
through digital games are best understood from the vantage points of both what is 
technically feasible and what the game designer’s intentions are. If we assume that, 
in reality, we are immersed in a range of environments, then one can also assume 
that many digital games attempt to simúlate, if not emulate, this immersion, 
particularly in first-person-perspective games. 

Sitting at my desk as I write this, for the purposes of analysis I can assume I am 
immersed in the environment I perceive around me. Furthermore, I can identify 
several hierarchies and classes of environment; the environment of my room is part 
of the larger environment of my house, which itself is part of the larger suburban 
environment, and distinct visual and sonic environments (indeed, tactile and olfac- 
tory environments) can also be identified. From my particular perceptual location, 
these environments occupy different shared or overlapping spaces. Furthermore, 
positioning myself and others within those environments enables me to describe 
them as ecologies; in sound terms, the acoustic ecologies and acoustic communities 
described by Schafer (1994) and Truax (2001). Assuming a human is immersed in 
such environments, 4 then the game designer (of FPS games, for example) attempts 
to create similar ecologies by immersing the player in the environments of the game 
world. There are, however, several problems in establishing this immersion, and 
they all relate to the technology used in digital games. 

Some of the differences and similarities between sound in the real world and 
sound in digital games have already been discussed as regards their technical origin 
(e.g., the workings of synchresis in combining distinct sensory events in different 
modalities into one perceptual event). Sound emanates from the vibrating object 
that is the cause of the sound; digital games, however, must propose apparent sound 
sources on the screen physically sepárate from the real vibrating sound sources 
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(e.g., loudspeakers or headphones). Similarities usually arise from the simple fact 
that the reproduction of sonic and visual artifacts in the game world takes place in 
the real world; whether game-world sound or real-world sound, sound still reaches 
our ears through vibrating air, where it is processed by the same sensory and per- 
ceptual organs and faculties and interpreted using experience from both worlds. 
Likewise, the sound fields of FPS games are designed to be all-encompassing and 
omnidirectional 5 as opposed to the restricted visual field; FPS games provide a more 
highly restricted field of view than in reality, and, barring the use of virtual-reality 
headsets or goggles, this field is presented to the player as merely part of the larger, 
real visual field. 

Limitations of digital-game technology impose other differences than a separa- 
tion between real-sound source and apparent-sound source. In an FPS game, the 
player is presented with a first-person perspective on the visual game world. 
Similarly, the player is positioned within the game’s sound field as a first-person 
auditor (Grimshaw 2008a, 83). This is analogous to the way a human hears the real 
world. No digital game, however, other than experimental games using head- or 
eye-tracking devices, will play and process sound according to the position of the 
player’s head or body. In real-world environments, dynamic sound sources move 
around the listener, but listeners, too, can alter the relative position, the intensity, 
and the timbral properties of sound through their own head or body movement. In 
digital games, the player must move the character in a certain direction or roíate the 
character’s entire body in order to effect a change in the sound properties listed 
earlier. In effect, the entire visual and acoustic environments of the game world are 
rotated or otherwise acoustically processed as per the sophistication of the game 
engine being used, relative to the first-person point of audition (which is the same 
as the first-person point of perspective). This point, regardless of whether the player 
is wearing headphones or is the focal point of loudspeakers, is located at the player’s 
head. Visually and aurally, the player is at the center of the game world, and, in this, 
there is no difference between viewing and listening to the real world. In FPS digital 
games, though, while images seem to move around the character s point of view, 
sounds always move about the player s head. 

In reality, the number and variety of sounds are, for all practical purposes infi- 
nite. Various drops of water have subtle sound differences depending upon the 
physical properties of the drop, the position of the listener, the environment, and 
the growing size of the pool they form. Digital games are not yet able to sonically 
compete with nature. 6 The reason for this is twofold: a limit to the storage capacity 
of digital-game equipment and a limit to the processing power of the equipment. 
Sound design, for FPS games particularly, is primarily an atomistic process rather 
than a holistic one. The sound designer has no way of knowing, for example, how 
many footstep sounds, at what speed, and on what surface will need to be heard 
during a game and so, ideally, should provide an infinite number of audio samples 
in order to match the potential variety of such sounds. The number of footsteps, 
speed of movement, and surface on which to move are choices within the gift of the 
player, 7 and so, individual audio samples, tailored to any possible combination, 
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should be provided to be triggered when required. However, the storage capacity of 
the game’s distribution médium limits the number of audio samples that can be 
provided. Furthermore, audio samples will be buffered in random access memory 
(for rapid access), which not only has other requirements made of it but also has a 
much-reduced capacity compared to hard drives and DVDs. There is also both a 
hardware and a software limit to the number of audio samples that can be played 
simultaneously. Complicating matters further is the fact that there might not only 
be a requirement for a variety of footsteps in a typical FPS game but also a require- 
ment for a range of gunshots, voices, ambient noises, thuds, bangs, explosions, 
screams, and so on such that, in addition to the game software itself and the image 
files for screen display, no current game-storage médium has the capacity needed. 
The number of audio samples delivered with the game is, therefore, reduced to 
such an extent that no game yet can truly emulate the sound of a sonically rich, 
real-world environment. 

Game designers attempt to ameliorate these limitations by a variety of meth- 
ods. For example, a single audio sample might comprise several sounds, particu- 
larly if the sample consists of ambient sounds and is intended to be looped. 8 Audio 
compression 9 might be used to reduce the storage requirement of each sample 
without overly compromising the sound quality. Variety in repetitive sound classes 
(such as footsteps) is mimicked to some extent by the random use of a small set of 
similar samples. Increasingly, audio samples will be processed upon playback; for 
example, a reverberation will be applied that approximates the acoustic properties 
of the spaces depicted on-screen relative to the first-person point of audition. In 
game consoles that use dedicated audio digital-signal processor (DSP) chips, the 
real-time processing achieved can be quite complex but will always require control- 
ling from the central processor unit (CPU). This processor must also be used for 
other tasks within the game, many of which will be prioritized. In attempting to 
limit the amount of processing required of the CPU for reverberation or instruc- 
tions to DSP chips, the result can only be a rough approximation of the dynamic 
reverberation artifacts a human will usually hear upon moving through a typical 
real-world space. 

Audio samples might be synthesized, or they might be recordings of real-world 
objects that are then synchronized with representations of those objects within the 
game world, whether those objects are displayed on-screen or not. They might be 
those recordings processed and used in a potentially schizophonic manner (Schafer 
1994, 89-91), applied to a context different from that when originally recorded. For 
example, an authentic recording of a SPAS shotgun might be used every time the 
player fires that model in an FPS game. The type of realism that FPS games in par- 
ticular attempt, where game objects’ parameters and motions are modeled on real- 
world physical properties and behaviors, makes use of authentic audio recordings 
in an attempt to create a sense of realism. The use of authentic audio samples 
recorded in the real world and played back in the virtual world raises several impor- 
tant questions, the answers to which will help understand immersion in FPS games. 
For instance, how far should one pursue authenticity in digital games? Too much 
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realism — a realism of theme that comprises “plausibility of characterization, cir- 
cumstance and action” (Córner 1992, 100) combined with authentic audio samples, 
photorealistic imagery, and accurate physical modeling — potentially results in a 
simulation or even an emulation rather than a game. Realism in the FPS game, 
pursued to its logical conclusión (the death of players rather than characters), would 
not be permitted under any current legislature; thus, rather than realism per se, 
codes of realism are used that are often based on convention within the genre or are 
imported as schemas (Douglas and Hargadon 2000) from our experiences else- 
where. A second question is based upon the notion of schizophonia; namely, 
are the sounds heard in the game dissociated from their cause and context? Prima 
facie, the answer would be yes; sound has been plucked from a real-world sound 
source and uprooted from its environmental and causal context and placed in a 
virtual world to act as the sonic surrogate to mute pixels. Certainly, in realism FPS 
games, there would seem to be an irreconcilable tensión between the desire for real- 
ism and authenticity of sound and Schafer’s claim that such a situation is schizo- 
phonic, producing a synthetic soundscape rather than an authentic, causally real 
soundscape. 

The answer to such questions of authenticity and schizophonia may be found 
by examining the significance and level of sound indexicality in the context of the 
game world (rather than the context of the recording). Film theorists have long 
pointed out that cinematic sound is not only schizophonic but, very often, is also 
quite unrelated to the apparent sound source shown on-screen. In a reference to the 
synchresis already discussed here, Lastra (2000, 147, 207), writing about tin-sheet 
thunder and coconut-shell horse hooves, States that “fidelity to source is not a 
property of film sound, but an effect of synchronization” and views the stockpiling 
of analog and digital audio samples for use in film audio dubbing as a production 
method to be used for the construction of a representation of reality. This bears a 
similarity to Chion’s (1994, 108) suggestion that the use of such sound conventions, 
no matter how divorced from reality, creates the “impression of realism [and such 
sounds become] our reference for reality itself.” The notions of new references for 
reality and of constructed representations of reality, codes of realism, may equally 
be applied to digital games. As already stated, however, games are highly interac- 
tive, and the resultant soundscape for games is not preconstructed and supplied on 
the distribution médium but, rather, is constructed in real time according to the 
actions of the player (or players in a multiplayer game). The game engine, there- 
fore, acts as a sonification engine by converting the nonaudio data, the player’s 
actions, and the player-derived game State into sound (Grimshaw 2008b, 119-20). 
In this case, the authentic audio samples in realism FPS games superimpose the 
indexicality of the real-world context with an indexicality derived from the sonifi- 
cation processes of the game. 10 The recording of a SPAS shotgun is heard not as 
the original recorded sound source in its real-world context but becomes, through 
the synchresis of player action, game image and audio samples, the sound of the 
player firing the game’s shotgun. The construction of another reality in FPS games 
using sounds that are indexical to the player’s actions in the game world rather than 
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being strongly indexical to the external world is a strong motivating factor in the 
facilitation of player immersion in the game. 


Immersion 


I have previously proposed that humans are immersed in their real-world environ- 
ments and that this immersion is what Computer game designers attempt to replí- 
cate, especially in first-person-perspective game worlds such as those found in FPS 
games. I have carefully elided the question of what exactly is meant by immersion 
but, having now laid the groundwork for understanding some of the processes at 
work in FPS game sound, I can no longer avoid attempting to answer that ques- 
tion — it is, after all, the focus of this chapter. The short answer to the question what 
is immersion ? is that no one knows. Grau (2003, 13) may describe the phenomenon 
as a mix of “diminishing critical distance . . . and increasing emotional involve- 
ment,” but this does not explain how immersion occurs, its processes, ñor does it 
account for the possibility of physical immersion. Am I not physically immersed in 
the sound environments of both reality and of digital games at the a center of audi- 
tory space” (Ihde 2007, 207)? I have lost track of the number of conference speakers 
I have heard who have owned up to their ignorance (before proceeding to claim 
how such and such a technology will aid in creating more immersive digital-game 
environments), and I, too, guiltily hold up my hand. Immersion has become 
the holy grail of first-person-perspective game design, the touchstone by which 
the quality of such games is measured. Yet, many questions remain unresolved; 
theoretical answers lack experimental data that would help formúlate the proce- 
dural rules required to create truly immersive digital-game worlds. This is not to 
say that no one has attempted to define immersion in digital games both theoreti- 
cally and/or experimentally, but such work, especially the latter form, is at an early 
stage and thin on the ground. This section briefly discusses such work as prepara- 
tion for a discussion on how sound is used for facilitating player immersion in 
FPS games. 

Early discussion describing psychological States similar in many ways to the 
descriptions of immersion listed later do exist. The philosophy and ideal of absorp- 
tion held by eighteenth-century French art critic Diderot has been summed up as 
an obliteration of a beholder’s presence in front of the painting and the transporta- 
tionoPYhebeholder’s physical presence [to] within the painting [wherebybeholder 
and painting become] a closed and self-sufficient System” (Fried 1980, 131-32). 11 
Immersion in virtual environments, 12 whether the virtual environments of digital 
games or of other applications, has been explained by a variety of related terminol- 
ogy. Thus, concepts such a s presence, being there , involvement> engagement , and flow 
are used in part, whole, or combination to describe States or processes of immer- 
sion. Discussing the technology of virtual environments, presence is used to describe 
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either “a sense of being able to touch and manipúlate a virtual object” or, more 
germane to this chapter, “a sense of being and acting inside a virtual place” (Reiner 
and Hecht 2009, 183). In the latter sense, presence is the “direct result of perception 
rather than sensation [and] the mental constructions that people build from stimuli 
are more important than the stimuli themselves” (Fencott 1999). Perception rather 
than sensation is at the root of Slater’s (2002) Gestalt-derived theory of presence in 
virtual environments, in which the brain chooses from among a set of hypotheses 
in order to determine where one feels present — in the real world or in the virtual 
world. A switch between hypotheses is termed a “break in presence” by Slater. 
Brenton and others (2005), in a discussion that uses theories of presence to derive a 
perceptual basis for the uncanny valley theory, 13 suggest that we instead superim- 
pose these hypotheses and that our sense of presence relates to the dominant 
hypothesis. 14 

A number of scholars have defined frameworks or rule sets for player immer- 
sion in digital games that prescribe the conditions for immersion, where immersion 
relates to player engagement or involvement in the game. Kearney and Pivec (2007) 
statethatimmersionprovidesthemotivation — which they relate toCsíkszentmihályi’s 
concept of flow — for the player to repeatedly engage with the game, and they use a 
study of eye movement and blink rate to determine when a State of immersion is 
being experienced; theoretically, the less movement and the lower the blink rate, 
the more immersive the game experience is. Their requirements for an immersive 
State in the game inelude an emotional involvement in the game, an altered sense 
of time, and a lowered awareness of the player’s surroundings during gameplay. 
The latter State, in particular, may be related to Brenton et al.’s theory of the 
superimposition of presence hypotheses rather than Slater’s switch of hypotheses, 
which requires a lack of awareness of the surroundings once the immersive switch 
has taken place; the player is still required to use the game’s hardware to engage 
with the game since such hardware is not a part of the game world and is still 
able to respond to real-world alarm signáis, for example. Calleja (2007) appro- 
priates Goffman’s metaphor of the frame in order to construct an involvement 
model to explain immersion in digital games. In order to foster what Calleja terms 
incorporation in the game (a term he prefers to immersion ), the player fluidly 
switches between six frames of involvement: tactical; performative; affective; shared; 
narrative; and spatial. The process of internalizing these frames is related to flow 
but, Calleja explains, cannot be equated with it as flow is a description of activity 
rather than a description of the environment in which such activity takes place. 
I return to the concept of incorporation later when I discuss the acoustic ecology of 
FPS games. 

Following Fencott’s analysis of presence, McMahan (2003, 75-76) suggests that 
immersion is facilitated in digital games through the design of surities and surprises 
into the game world. The former are cues (Fencott’s term) in which elements in the 
game confirm and conform to the player’s expectations; in FPS games, these might 
be the ability to navigate (the character) around the apparently three-dimensional 
spaces of the world or an appropriate use of paraspaces (Parkes and Thrift 1980) 
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in which, for example, there is a set of weaponry that is contemporaneous with the 
game’s premise. Surprises are either an aid to navigation around the game world or 
provoke an action from the player. 15 These are an indication of the interactivity of 
digital games, in which the player is able to engage with the game spaces and objects; 
they might be visual signs that indicate routes through the game level or guns that 
allow the player to deal harm to enemy characters. For McMahan, a defining 
structure of immersion is realism: a consistent social realism (similar to Corneas 
plausibility of theme) and a perceptual realism, the use of perspective in the visual 
design of the game world, for example. She further States that, for immersion to 
occur, the actions of the player cc must have a non-trivial impact on the environ- 
ment” and, following from the recognition of active interactivity as an element of 
immersion, that “immersion is not . . . wholly dependent on audio- or photo- 
realism” (McMahan 2003, 68-69). Hidden within this last statement is the implicit 
suggestion that immersion in game worlds is dependent upon audio- and photo- 
realism and that anything less risks a loss of immersion, a break in presence. Here, 
realism, in the case of sound, is assumed to be represented by the authenticity of 
audio samples in the game; as I have already noted in the context of cinema, a vari- 
ety of scholars have stated that such authenticity is not a prerequisite for a sense of 
realism. In the context of sound in digital games, I return to this point later. 

Building upon Pine and Gilmore’s (1999) work on experience, Ermi and Mayrá 
(2005) claim that “immersion [in digital games] means becoming physically or 
virtually a part of the experience itself.” Furthermore, they distinguish between 
three forms or States of immersion: sensory; challenge based; and imaginative 
immersion. Again, I return to these ideas later, particuiarly the sense that Calleja’s 
incorporation is related to the notion of sensory immersion and of being physically 
a part of the game’s sound world, when I discuss the role of sound in digital- 
game immersion. The idea of sensory immersion, as opposed to imaginative or 
challenge-based immersion, is fundamental to Carr’s (2006, 69) category of percep- 
tual immersion, where the player’s senses are monopolized by the game world. 16 
A final statement on immersion comes courtesy of Garcia (2006, 23), who suggests 
that “in the most immersing environments reminders of the structural level of the 
game are gone.” To bring the discussion back to Diderot, the unity of beholder and 
painting in a “self-sufficient system” or the unity of player and game world in an 
autopoietic, self-organizing system (Grimshaw 2008c) “is the strongest magic of 
art” (Diderot, quoted and translated in Fried 1980, 13o). 17 

Three themes can be abstracted from the preceding overview of immersion 
theories: presence; active engagement; and codes of realism. They are not mutually 
exclusive, and indeed there is the strong suggestion that all three must be achieved 
to some degree to provide the conditions for immersion in digital games. However, 
common to the three themes is the interplay, perhaps even tensión, between sensa- 
tion and perception. Is presence, for example, a matter of being physically within a 
concrete reality, the phenomenological realm of the senses, or is it a matter of a 
mental construct, a Platonic realm of thoughts and ideas? Is immersion effected 
only when all of the elements of the digital game have the prerequisite levels of 
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verisimilitude and/or veridicality, the “semblance of truth” required for Coleridge’s 
(1817) “willing suspensión of disbelief,” and, if so, what is that level? Can someone 
with no experience of the virtual environments of digital games and, therefore, 
without the weight of prior knowledge and expectation, become immersed in those 
worlds? The following section delves into the role of FPS game sound in immersion 
in an attempt to answer just such questions. 


The Role of Sound in FPS 
Game Immersion 


Here I focus on the relationship between digital game sound and the player and 
how such sound is designed to achieve a perception of immersion and, indeed, 
whether such immersion is achieved. It is important to note that immersive game 
worlds do not attempt this immersion through sound alone; the designer’s arsenal 
contains a battery of sensory and interactive game technology for just such a pur- 
pose. Haptic feedback in FPS games is at an early experimental stage. It is more 
advanced (but still primitive) in other digital game genres such as racing games on 
gaming consoles, particularly where the handheld controller, for example, might 
vibrate in response to crashes or uneven road surfaces. In the context of FPS games, 
therefore, I deal solely with image as it relates to sound. 

In addition to hearing, visión forms a large part of the experience of FPS games, 
particularly where the display on-screen is designed to lead to player immersion in 
the game world. As already noted, this immersion is initiated (in FPS games) with 
the game character being seen only in part. A pair of arms clutching a weapon 
recedes from the bottom of the screen into the space of the game world — a virtual 
prosthetic extensión of the player, bridging reality and virtuality. The Identification 
of the player with the character is further strengthened by the control the player has 
over these arms, the ability to move the character around the game world, and the 
player’s ability to use that character to interact with objects and other characters in 
the environment. As the character responds to the player’s control, it is the player 
who perceives that they are navigating in and interacting with the game world, and, 
in FPS games, responses to misfortune are always in the nature of “I got killed” 
rather than Cí my in-game character was destroyed.” 

As previously mentioned, sound can be used in conjunction with image to 
make sense of the real-world environments in which we are situated; it can also be 
used in the absence of image by making greater use of experience and imagination. 
In the FPS scenario sketched earlier in the chapter, I used sound in both ways 
to engage with the game. Even though the vin rouge sign itself does not move, its 
creaking sound and the direction it arrives from allow me to imagine that it does, 
especially as the image itself is now out of view. The images depicting the ruined 
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village may make use of perspective in which there is a scaling of size toward a van- 
ishing point but in which the entities themselves are fíat, two-dimensional objects 
displayed on a fíat, two-dimensional screen. The distant muffled explosions, the 
locational properties of the sounds I hear, and their reverberant characteristics 
create of that village a three-dimensional world that, in its sound environment at 
least, approaches somewhat the level of detail of sound environments of the real 
world. 

Much of the sound I hear in the FPS game is acousmatic sound. Sound from the 
game world surrounds me, unlimited in its directionality; my visual window onto 
that world is severely restricted, as in reality, and even more so in that the Computer 
screen and its images form only a small part of what I can see. Acousmatic sound in 
the FPS game allows me, with the benefit of an experience that derives both from 
reality and from previous playings of the game, to visualize sound. Footsteps behind 
me indícate the presence of someone or something in both reality and virtuality; in 
this game’s world, they further connote a potential threat. Distant sounds of battle 
combined with radio messages allow me to imagine situations that team members 
might be experiencing and to follow the progress of the game beyond what the screen 
offers. This, then, is the sonic world of the FPS game, which works in conjunction 
with image and imagination to engage the player in the game world. 

I previously noted three themes that are apparent in scholarly discussion of 
immersion: presence, active engagement, and codes of realism. The presumption is 
that all three, in varying degrees and combinations, are required to define and/or 
create immersion. It might be supposed that presence in digital games (i.e., a sense 
of being and acting inside the game world) is analogous to immersion — Brown and 
Cairns (2004, 3) equate what they cali “total immersion” with presence — but this 
definition does not take account of the requirement to have McMahan’s nontrivial 
effect on that world for immersion to occur. Acting inside a virtual world is not a 
strong enough term to describe the player’s ability to effect change in the world, 
indeed, to create that world. Limiting our discussion to sound, the soundscapes of 
FPS games may well require a “discerning Subject” to be present (Bóhme 2000, 15), 
but all of the sounds in that soundscape are not merely sounded by the game engine’s 
sonification processes in response to the player’s very presence in the game world 
but are also changed in direct response to the player’s actions. This is because of the 
player’s engagement with the interactive affordance offered by the technology of 
the game: As the player navigates through the FPS game world and plays the game, 
the soundscape changes accordingly. Movement creates the sound of footsteps, 
firing the sound of gunfire, and environmental or ambient sounds fade in and out 
as the player moves through the game’s spaces. The player’s imagination is engaged 
by acousmatic sounds, and the sounds of other players’ characters in a multiplayer 
game indícate that those players are also present in and engaged with the same 
world; players’ acoustic environments dynamically overlap and part company. 

What level of realism or authenticity of audio samples is required for immersion? 
Jorgensen (2006, 13), writing in the context of digital games, States that “realistic audio 
samples . . . will make the audio world more immersive.” If, by “realistic,” she means 
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authentic, we have already seen that her assertion is not necessarily the case. What 
is the authentic sound of the zombies in Left 4 Dead (Valve Corporation 2008)? 
What is the authentic sound of the BFG 18 in Quake III Arena (id Software 1999) and 
the various imaginary monsters in Half-Life 2 (Valve Software 2004)? Would the 
authentic, realistic absence of sound in a game played out in the vacuum of space 
make it more of an immersive experience? 

Shilling, Zyda, and Wardynski (2002), using the realism FPS game America’s 
Army (MOVES Institute 2002), measured players for electrodermal and heart-rate 
responses in order to test emotional arousal in the presence of the game’s authentic 
sounds. While not an experiment to measure immersion per se, the authors assumed 
that “emotional arousal has a positive impact on [the] sense of immersion in virtual 
environments” and, following the results of their experiment, observed that immer- 
sion is crucially enhanced by precise synchronization of sound and the action dis- 
played on the screen; this is objective (but not conclusive) evidence, in the context 
of digital games, for the effect of synchresis. 

Even in the presence of nonauthentic sounds, however, game players experience 
immersion. This was the case in a psychophysiological experiment on diegetic sound 
and nondiegetic music in the FPS game Half-Life 2 (Grimshaw, Lindley, and Nacke 
2008). This game has a range of authentic audio samples of footsteps on a variety of 
surfaces, for example, and has reasonably accurate, real-time modeling of reverbera- 
ron according to the volumes and materials of the immediate game space; it also 
ineludes a variety of decidedly nonauthentic vocalizations ff om the game’s imaginary 
creatures. 19 Players 5 subjective responses gathered after gameplay categorically indi- 
cated perceptions of immersion in the game world (and these perceptions increased 
with the addition of sound), while the results of electromyography (EMG) and gal- 
vanic skin response (GSR) measurements taken during gameplay indicated heightened 
arousal when game sound was heard (as opposed to when it was muted). 20 

Darley (2000, 16-17) defines realism, in the context of digital media such as 
games and computer-generated films, as the degree of resemblance to real-world 
objeets; while not discussing audio realism, photography is given as the yardstick 
for images. Photo-realist images, he claims, are indexical to their real-world coun- 
terparts, and, by this definition, authentic audio samples are indexical to real-world 
sounds and, by extensión, to their sound sources. Following this logic, where digital 
visual realism derives solely from the indexicality of the photograph, if used in a 
digital game, audio samples are phono-realist regardless of their use and context. 
Yet realism may also derive from theme and action as discussed earlier, and, 
although authentic, phono-realist audio samples are widely used in FPS games, a 
sense of realism also derives from the way they are used in the game. Cinematic 
sound design has the mantra “see a sound, hear a sound”; this is extensively used in 
FPS games, but the dictum in this case can be expanded to “do a sound, hear a 
sound.” The sonification of the player's in-game actions is the realism required, in 
part, for immersion in the game world rather than, necessarily, the use of authentic 
audio samples. In simulating the processes of acoustic environments of the real 
world within virtual worlds, game designers provide not only indexical, real-world 
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sounds, sometimes in addition to more fantastical sounds, but also a simulation of 
sound génesis and behavior in the game world that is similar to that found in the 
real world. In this, the active relationship between the player and sound may be 
likened to the acoustic ecologies found in nature. Immersion is to be arrived at not 
only through the inclusión of authentic sound objects from the real world but also 
from the use and context of such sounds and other sounds. This is an immersion 
based primarily on contextual realism rather than object realism, verisimilitude of 
action rather than authenticity of sample. 

The game’s soundscape, its acoustic environment, and the player together form 
the acoustic ecology of the FPS game. Like Truax’s (2001, 66) concept of the acous- 
tic community “in which acoustic information plays a pervasive role in the lives of 
inhabitants,” the FPS game’s acoustic ecology has an effect upon the actions of the 
player (or players in a multiplayer game). Footsteps approaching from behind 
might cause the player to turn to meet a potential threat or to escape a particularly 
vicious in-game character (Grimshaw 2009). Psychophysiological data provide 
more detailed evidence of the effect on the player’s physiology and emotions while 
in that acoustic environment. However, the concept of an acoustic ecology goes 
further than that of an acoustic community in suggesting that the human is a fun- 
damental component of that ecology who not only responds to acoustic informa- 
tion but also generates acoustic information. This generation of acoustic information 
and, indeed, of the acoustic environment itself is evidence of the immersion of the 
player (or players) in the game world and of the sonically concrete reality of that 
game world. Calleja’s six frames of involvement to explain his notion of the incor- 
poration of the player into the game world is missing a seventh frame. While “incor- 
poration” and the performative frame hint at it, Ermi and Máyrá’s concept of 
sensory immersion provides the physicality lacking in Calleja’s model. Gene 
Youngblood (1970, 206) wrote that “The notion of ‘reality 5 will be utterly and finally 
obscured when we reach that point [of generating] totally convincing reality within 
the information processing systems . . . We’re entering a Mythic age of electronic 
realities that exist only on a metaphysical plañe. 55 Although he was discussing the 
future and the potential of visual computing hardware, it is not stretching the imag- 
ination too far to suggest that the immersion of the player in the game world 
through the incorporation of the player as part of the game’s acoustic ecology leads 
to a reality that is different from that of the real world — one where the player, 
through presence and a realist active engagement with the acoustic environment, is 
truly immersed in a mythic electronic reality, that of the game world. 


CONCLUSIONS 


Throughout this chapter I have used a variety of polarizing sets of terminology. 
With regard to one particular set, from a superficial reading of the chapter, the 
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reader might depart with the supposition that there is a real world and a virtual 
world and that immersion is simply a matter of stepping from one space into the 
other. This would be wrong. Although I have used such polarity to clarify my argu- 
ments, I have also dropped hints throughout the chapter pointing to a different 
State of affairs; the immersed player still utilizes real-world objects to interface with 
the game world and is attentive to real-world alarms, for instance. In “reality,” real 
world and virtual world are two poles of a continuum, along which the player is 
able to be transported, importing conventions and experiences and expropriating 
and interpreting meaning from either world. In the context of digital games, the 
waters of this apparent divide are about to become murkier still. 

William Whittington, describing sound and animation production practices at 
Pixar Animation Studio (this volume), discusses the future of such sound design as 
encompassing the synthesis of sound generated in response to the image and 
extends this to the possibilities for real-time sound synthesis in digital games 
(a possibility discussed elsewhere, such as in Grimshaw 2009). Sounds would be 
synthesized according to player action, game context, and game architecture. Let us 
take this idea a step further. As demonstrated throughout this chapter, there is 
plenty of evidence as to the effect of sound upon player affect and emotion. Turn 
this around: What about the effect of affect and emotion upon sound? Primitive 
consumer headsets that measure EMG and electroencephalography (EEG) are 
already available to interface with home computers and gaming systems. The work 
my colleagues and I are involved in asks the question, Can we use EMG and EEG 
output from the player to process or synthesize sound in a game world such that that 
sound itself alters the player s affect and emotion in a predictable manner? Can, for 
example, the player be made more frightened in a survival horror game through the 
specific alteration or creation of sound if the game engine “senses” the player is not 
frightened enough? 21 Can the stress, pitch, and rhythm patterns of a nonplayer 
character’s speech be changed in response to the player’s psychophysiological State? 
Can the sound of a monster be ripped out of the hidden recesses of the player’s ter- 
rors? Such a topic is one for another chapter (and continuing empirical work), so I 
conclude by asking how, should such a visión of all-encompassing, real-time sonic 
biofeedback be reached, would we then define immersion? What are the implica- 
tions for the already muddy distinctions between real world and virtual world? 
Would — and could — immersion be attained through a precise and calibrated 
manipulation of the player’s psychophysiological State, and would there be distinc- 
tions such as real world/virtual world, or would the gamer, instead, inhabit a blurred 
world immersed neither here in reality ñor there in virtuality but in a new form of 
space somewhere in between? 


NOTES 


1 However, it requires little skill to replace these audio samples with others if they are 
stored on a read-write médium. 
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2 I discount concert films, music video, and other forms where the musical score is 
intended to be diegetic. 

3 New sound-panel technology may change this in future; screens may also act as 
loudspeaker transducers able to reproduce sound that is sourced, with varying levels of 
accuracy, from the images on-screen. 

4 The notion that human + environment = ecology makes that assumption explicit. 

5 In the sense that sound will arrive at the player’s ears from a variety of game-world 
directions. 

6 As previously mentioned, modern digital games, in particular first-person- 
perspective games, typically use audio samples. 

7 Within the boundaries of the game’s level design. 

8 For example, a repeated audio sample of birdsong. 

9 Audio compression is a reduction in the dynamic range of the sound and thus a 
reduction in the digital storage required. 

ío See the Gestalt-based discussion given later on the superimposition of presence 
hypotheses and the domination of one over the other. 

íi I am indebted to Hillel Schwartz for directing me to Fried’s fascinating book. 

12 See Helmreich (this volume) for a discussion of immersion in sonified aquatic 
environments. 

13 The notion that, with robots and synthetic characters, fear can be the response as 
the visual representation of the character becomes more humanlike. 

14 This might help explain the ability to engage with and to foreground the virtual 
world of a digital game while simultaneously being aware of the real world in the 
background. 

13 To a certain extent, Schafer’s keynote, signal, and soundmark terminology may be 
related to McMahan’s concept of surities and surprises. 

16 This is an unfortunate conjunction of perception and sensation given the distinc- 
tions noted previously, particularly by Fencott (1999). Carr’s other category is psychologi- 
cal immersion, where the players become “engrossed through their imaginative or mental 
absorption” (2006, 69). 

17 This is not the place to continué the sometimes heated debate on whether digital 
games are an art form, but champions of that position who are looking for academic 
ammunition could do worse than triangúlate between Diderot, absorption/immersion, 
and digital games/art. 

18 Big Fucking Gun. 

19 Accepted that defining these sounds as “vocalizations” potentially classes them with 
other, ostensibly more authentic vocalizations taken from the real world. In this case, they 
have a level of authenticity because of a shared causality and indexicality, which is why 
they are recognized as vocalizations. 

20 As yet unpublished results using other physiological data gathered during the same 
experiment intriguingly point to gender differences, both subjective and objective, in the 
perception and experience of immersion. 

21 What would the ethical considerations of such biofeedback be? 
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CHAPTER 15 


THE SONIC 
PLAYPEN: SOUND 
DESIGN AND 
TECHNOLOGY IN 
PIXAR’S ANIMATED 

SHORTS 


WILLIAM WHITTINGTON 


Introduction 


In 1986 a newly formed, high-tech startup company named Pixar began producing 
animated shorts in order to challenge its designers to develop Computer appli- 
cations that showcased the design possibilities of its new technology — the Pixar 
Image Computer, which could render photorealistic images albeit in a rudimentary 
way. A few years earlier, these same engineers worked with George Lucas and 
Industrial Light 8c Magic (ILM) to develop special-effects software and sequences 
for Hollywood studio features such as Young Sherlock Holmes (1985) and Star Trek II: 
The Wrath ofKhan (1982), for which they created the “Génesis effect” (named for a 
Science probe that generated a planet out of “lifeless rock”). Eventually, Pixar 
transformed from a Computer company to become a leading studio in the field of 
3-D computer-generated animation, and its efforts marked Hollywood cinema’s 
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transition into the digital age. Much like the impact of cinema’s transition to sound 
in the late 1920S, this new focus on computer-based production processes shifted 
industrial practices, established new studio styles, and fostered a new form of film- 
making. This revolution was not entirely image based, however. Concurrently, film 
sound was undergoing a digital transformation of its own with the introduction of 
new sound formats such as Dolby Digital and Digital Theater System (DTS) and the 
rise of the sound design movement, which was driven in part by new portable 
recording and mixing devices, multichannel exhibition, and changes in the mode of 
sound production. 

Using a variety of theoretical perspectives from digital culture, sound studies, 
and traditional film studies, I argüe that as the techniques of computer-generated 
animation developed at Pixar, sound design became an integral aspect of this 
new mode of storytelling and overall filmic design. Foregoing an exclusively ono- 
matopoetic approach to sound, defined by the bangs , foooras, zooms , and honks 
commonly found in traditional cartoons, sound designers at Pixar worked with 
producers, directors, animators, and software engineers to establish an unprece- 
dented unification of sound and Computer imagery by borrowing live-action pro- 
duction techniques and reworking traditional animation strategies for sound use. 
In regard to technology, Pixar’s sound designers adopted new portable recording 
devices for the collection of sound effects, unified these “raw sounds” with the aid 
of sound samplers like the Synclavier and, later, software like Pro Tools, and mixed 
their efforts in the newest multichannel formats typically reserved for feature 
productions. More important, sound designers like Ben Burtt ( Star Wars series, 
WALL-E ) and Gary Rydstrom (Terminator 2: Judgment Day, Saving Prívate Ryan , 
and Monsters Inc .), who worked on many of the initial shorts, fostered audio strate- 
gies that emphasized sound perspective, spectacle (localization of effects and the 
establishment of offscreen space and environmental effects), and “hyperrealism” 
(a technique Rydstrom would later adapt to films such as Jurassic Park and 
Titanic, which were also heavily laden with Computer graphics, or CG). These 
technological innovations and new aesthetic approaches quickly established the 
sound-image relations in Pixar films as cinematically credible and viable for filmgo- 
ers. In short, the sound designs “sold” the images. The short films produced at 
Pixar are important in regard to this trend because they established the aesthetic 
and production patterns that formed the studio’s house style, which eventually 
migrated into the company’s successful feature films such as Toy Story (1995), 
Finding Nemo (2003), and Up (2009). In the short films, which make up the case 
studies for this chapter, sound is not only a dominant formal element, but also an 
important thematic one, principally as it relates to the notion of play. Many of the 
shorts explore childrearing and children’s games as a form of play, the interaction 
of sound and images as play, and the notion of play associated with music perfor- 
mance and the voice. From Luxo Jr. (1986) to Jack-Jack Attack (2005), filmgoers are 
immersed in a kind of sonic playpen, surrounded by innovative sound designs and 
Computer images that have reshaped our notions of cinema and animation in the 
digital age. 
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Sound Design’s Expanding Role in the 

Era of Digital Animation 


Animation i s unique within the mode of Hollywood film production, above all in 
relation to sound. Typically, the overall sound track for any film consists of dia- 
logue, music, and effects (Foley, hard effects, and ambiances), and within Hollywood 
cinema, a hierarchy has formed around issues of the voice, predominan tly focusing 
on narrative intelligibility or who tells the story. 1 For this reason, the dialogue tracks 
form the scaffolding around which the entire sound track for a live-action feature is 
designed. Traditional animation, however, lacks production recordings and some- 
times even dialogue, which is the case in a number of the Pixar shorts. This lack 
offers both Creative freedoms and challenges for sound designers, who must not 
only develop the specific sounds for the film but also establish the overall rhythm of 
the film, which supports the story beats, character actions, and plot points. In the 
2001 Academy Award-winning short For the Birds , director Ralph Eggleston and 
sound designers Tom Meyers and Jory Prum build comedie tensión around the 
squeaks, squawks, and pecking of a gang of birds who want their club to remain 
exclusive. In a rhythmic chant of squeaks (recorded from a collection of “squeaky 
toys”), the birds egg on two of their flock to peck at the feet of a gangly outsider, 
who dangles precariously upside down from the telephone wire on which they are 
all perched (Amidi 2009, 30). Gravity and a well-timed moment of silence on the 
sound track provide the crucial comedie punch line as the birds achieve their 
goal but are launched upward (and featherless) with the sharp sound effect of a 
“boing.” Sound and image work together rhythmically to comment on the absur- 
dity and perils of a mob mentality without ever resorting to the use of dialogue. It 
is important to note that the choice of squeaky toys to create the character sound 
designs accesses nostalgic notions of childhood play and the tactile sensations and 
emotional delight of squeezing these air-filled toys. The visual design of the plump 
little birds reinforces the concept, and, even within the narrative action, the larger 
bird puffs and honks at the air that is filled with tiny feathers from the now naked 
flock. It is a comedie gesture that affirms the themes of air and flight, breathing and 
exhaling, and silence and squawking. In this way, play and peril are linked through 
sound. 

Since its introduction in the 1970S, the term sound design has been multifaceted 
in its application and definition within cinema and sound studies. One of the initial 
definitions of sound design comes from the process of designing specific sound 
effects like those noted earlier. Animation has always been highly adept at creating 
and applying individual sounds for sonic punctuation and comedie effect, borrow- 
ing techniques from radio and the theater. As sound historian Robert L. Mott has 
noted in relation to early Disney animation, “When Mickey Mouse hit a baseball 
over the fence, the sound of the hit was provided by a wooden block, and the 
slide whistle sounded as the baseball went soaring into the air” (Mott 1990, 83). 
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These sound effects displaced realistic sounds to create broad comedie gestures or 
slapstick, which Charlie Chaplin once called “playful pain” (Mott 1990, 80). These 
early sound constructions have since become cliché. While sound designers today 
often apply the same strategies of timing and comedie intents, the sounds are much 
different in their material manufacture, as well as spatial encoding and aesthetic 
design. This new approach is currently supported by sound-editing software like 
Pro Tools, which has Consolidated sound editing, mixing, and previewing into a 
virtual work environment. Within the program, the sound session for a particular 
sequence is laid out horizontally, providing a graphic representation of the sound 
wave forms for each individual effect, which can then be manipulated by using 
various sound filters (high- or low-frequency pass filters, for example) or proces- 
sors. This digital system differs from the previous magnetic film system in that 
effects can be layered without significant noise buildup, sound sets can be created 
and stored within the digital environment for easy recall, and composite sound 
effects can be previewed in stereo or even multichannel formats without delay. 
Gone are the days when sound personnel would have to wait until the final mix to 
hear the composite sound effects. 

In the Pixar short Luxo Jr ., a baby lamp pops a ball, which deflates with a come- 
die whistle of air. Unlike the earlier Disney example, the sound-effects design is 
multilayered. It ineludes the squeak of the plástic as the ball rolls on the hard wood 
surface, the pop of plástic, the expulsión of air through a plug, the deflating whistle, 
and the shifting of the coiled springs of the tiny lamp as it rides the ball down. In 
addition, the effects are recorded in cióse perspective to their sources using a por- 
table Nagra in order to anchor them with a sense of image and sound credibility. 
This is not to suggest that the sound designer just gathers one versión of each of 
these effects and combines them for the desired result. Sound design is a process of 
ongoing experimentation. Gary Rydstrom notes, ÍC I have a PowerPoint to explain 
the design process that I use when I lecture about sound. The first slide reads, Record 
a sound , try it , find it doesnt work. The next card reads, Select another sound , try it, 
find it doesnt work , and so on. I keep doing this until I get to the final card that 
reads, Run out of time 5 (Rydstrom 2010). The design process in the digital age is one 
of nearly infinite choices that are limited only by aesthetic imagination and, of 
course, postproduction economics. While the comedie outeomes remain similar 
between the oíd and the new animation forms in the Luxo Jr. short, the sensibilities 
of design and their organization have shifted to create a heightened cinematic 
reality that resembles live-action sound. 

The development of overall aesthetic sensibility for computer-generated ani- 
mation is in part due to the involvement and sensibilities of specific sound person- 
nel, considerations that are linked to another definition of the term sound design. 
Historically, the term sound design has also been applied to a the planning and pat- 
terns of the overall sound track” by a specific individual (Whittington 2007, 2-3). In 
many cases, the sound designer becomes akin to the director of the sound track, 
making sure that the various film reels exhibit thematic and formal unity. This is 
accomplished by the control of sonic aspeets such as the use of leitmotifs, the design 
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of effects for emotive impact, and attentiveness to mix practices. The key sound 
designers at Pixar have been Ben Burtt, Gary Rydstrom, and Tom Meyers ( Pitch 
Black , Armageddon , and Up). In the Academy Award-nominated short Lifted , the 
film’s director, Gary Rydstrom, makes playful reference to being a sound designer 
by comically drawing parallels between the job of the mixer and the job of a young 
teenaged alien learning to abduct humans firom a hovering spacecraft. In this 
homage to Steven Spielberg’s Cióse Encounters of the Third Kind (1977), an alien 
trainee named “Stu” sits behind a vast consolé of toggle switches (an analogy to a 
sound-mixing board), trying to retrieve his target human from a farmhouse. All the 
while, he is being graded on his very spotty performance by a gelatinous and impas- 
sive instructor (a stand-in for a producer/director). The overall sound-design strat- 
egy is an exercise in cross-cutting, as Stu’s frustrated actions, articulated by the 
sounds of his hands raking the consolé and a Pong-like Computer ping, lead to the 
abducted man being flung about the interior of the home. The sound design illus- 
trates this through various thumps, body hits, and crashes. Causality is the source 
of the comedy, but the analogy to the perilous process of Hollywood filmmaking 
cannot be missed. Sound designers are always under the scrutiny of someone, 
whether it be the producer, the director, or other sound personnel. Finally, the 
instructor intervenes at the consolé and reverses the chaos by returning the abductee 
to his bed. The lesson of this exercise in Computer animation is that at times sound 
and image design are about interplay. As in the performance of a duet, the comedy 
can be drawn from either side of the design equation. For example, when the 
abducted man gets stuck in the tree, the visual track reveáis a static wide shot of the 
tree and a long beam of light emanating from the flying saucer. At the same time, 
the sound track offers the “brutal” sounds of the crunching of leaves and tree 
branches, as well as body parts, as the man gets stuck, clearly offering filmgoers an 
example of Chaplin’s “painful play” (Amidi 2009, 39). 

For Rydstrom, if the sound design is recorded and organized well, it “merges 
into the image, brings it to life,” and the constructions are “not cartoon-y” but 
rather “reality-based” (Kenny 2004, 1-2). This is the foundation of his “hyperreal- 
istic” sound-design style, a philosophy that draws inspiration from hyperrealism in 
visual culture, which is interested in our perception of the “real,” a property that is 
both the subject and objective in hyperrealistic art. In the field of sound design, 
Rydstrom creates stylized constructions that access familiar sounds (raw sound 
“events”), yet he recombines these effects to create seamless sound impressions that 
are viscerally dynamic, anthropomorphic, and preoccupied with codes of height- 
ened cinematic realism often found in live-action film. His goal is not to create an 
exact duplicate or simulation of a sound event, but rather to provide a lie that tells 
a dramatic truth. In unpacking this lie, we find cinematic codes of spatial and tem- 
poral design, genre expectations, and the engagement of an array of psychoacoustic 
properties of sound. Visual equivalents of these properties (mainly an attentiveness 
to movement, space, and temporality) have been a goal of animated Disney films 
from the very beginning. Digital theorist Andrew Darley notes, “Precedents can be 
found in Disney Studio’s attempts, from the late 1930S, to mobilize certain of the 
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existing aesthetic codes of classical narrative cinema (live action) and to intégrate 
them in a more rigorous fashion than had hitherto been the case within drawn 
cartoon form” and in doing so provided a “heightened realism” (Darley 1997, 19). 
Rydstrom adapts this approach to sound design. 

Early in their careers, many of the key sound designers at Pixar were associated 
with San Francisco Bay Area film-production companies and sound houses such as 
Sprocket Systems, later redubbed Skywalker Sound. They have collaborated with 
various directors such as James Cameron, Francis Ford Coppola, John Lasseter, 
George Lucas, and Steven Spielberg. These sound-conscious filmmakers encour- 
aged and in many instances demanded innovations in relation to sound technol- 
ogy, mixing, and design. As a result, the dudes of sound-effects recording, editing, 
Foley, and mixing became less constrained in terms of the división of labor within 
the mode of production. This shift aróse in part as independent production units 
adopted collaborative strategies that grew out of production approaches from film 
schools and very low-budget productions in which every member of the crew was 
required to perform one or more tasks. According to Rydstrom, this environment 
fostered an important “mentoring” network for a core group of sound designers, in 
which attentiveness to sound experimentation and technological innovation was 
the norm, unlike the hierarchical system within the classical Hollywood mode of 
production (Kenny 2004, 1). John Lasseter, Pixar’s chief Creative officer and one of 
its most important directors, agrees with this assessment: “As a manager, it is my 
task to abolish hierarchies” (Reis 2009, A 6). This sense of collaborative creativity in 
the field of sound was carried over to the newly formed Pixar Animation Studio and 
integrated well into its Creative environment, which was attempting to blur the lines 
between filmmaking and computing. 


Technological Innovation and 
Animation in Transition 


The exchanges between technological development and aesthetic design are at the 
core of the Pixar philosophy, and they have deeply influenced the image and sound 
styles that have emerged at the studio. According to Lasseter, “Art challenges tech- 
nology, and technology inspires the art. That’s it in a nutshell — the way we work at 
Pixar” (Milsom 2008, 08:50 min.). One of the key pieces of sound technology 
adopted very early by Pixar sound personnel was the Synclavier, a synthesizer that 
was modified with a Winchester diskette add-on and floppy drive (and later a hard 
drive) to function as a sound sampler that could hold a library of “raw” materials 
for instant access and manipulation. According to Rydstrom, “With sampled 
sounds in RAM, you can instantly pitch-bend it and layer it and play it without 
using any processing time” (Kenny 2004, 3). In his short Lifted , Rydstrom worked 
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with a similar yet far more advanced software-based system to build the frustrated 
vocalizations of the alien trainee, “Stu,” using sampled growls and vocalizations 
from his own dog, Sparky. In the short film Tin Toy y the Synclavier technology 
brought together all of the sounds for the one-man band “Tinny,” including his 
horn, drum, and cymbals. During the production process, the Synclavier allowed 
the consolidation of the duties of a sound recordist, editor and mixer, which sup- 
ports the aims (and definition) of the sound designer. Among sound scholars and 
practitioners, there is consensus that technological developments such as the 
Synclavier and Digidesign’s Pro Tools formúlate an essential part of the definition 
and process of sound design. In fact, sound designer and editor Walter Murch 
(THX1138, The Conversation ) first associated the term with his experience using the 
quadraphonic exhibition technology during the development of the sound track 
for Apocalypse Now (1979). As he used the multichannel technology to map out the 
sound fields established by the various speakers in the motion picture theater, he 
would drape or “design” sound effects and music like a set designer in order to 
establish an immersive experience for the filmgoers. 

Historically, sound technology has never been fixed or standardized within the 
mode of production, however. From the very earliest attempts at film sound syn- 
chronization, the development and use of sound advances have sprung from a 
complex set of drivers, including economic competition, licensing and patents, 
exhibition quality control, and, perhaps most important of all, the particular needs 
of a specific film production. Innovation often results from logistical and produc- 
tion challenges that arise from a particular story that is being told. Similarly, inno- 
vation within the field of animation has not been a fixed or formal process. The 
needs of animated stories have often driven the development of specific audio and 
visual technologies and techniques. In the 1940S, for example, Disney developed the 
multiplane camera setup, an animation stand that allowed cels to be divided into 
layers to simúlate depth of field, and “these [cels] could be moved frame by frame 
at varying rates toward or away from the camera, giving a powerful illusion of glid- 
ing through a three-dimensional space” (Thompson and Bordwell 1994, 261). This 
technique was developed for use on films such as Pinocchio (1940) and Bambi (1942), 
and it arguably set the stage for the development of 3-D exhibition technologies, 
which have become part of the production and exhibition strategies of the current 
computer-animation cycle. In regard to sound technology specifically, Disney also 
worked with RCA at this same time on the development of a new multichannel 
sound presentation for Fantasía (1940), which featured what many describe today 
as the precursor to “surround sound.” The goal was to sepárate various orchestral 
instruments to match the movements of the characters within the animated musi- 
cal sequences of the film. This effect was achieved by using two sepárate but inter- 
locked optical tracks. In one configuration scheme, the speaker array consisted of 
fifty-four speakers throughout the theater and provided one of the first multichan- 
nel experiences for filmgoers. While the system was never adapted as an industry 
standard, it did present the possibility of a new cinematic-sound experience that 
would inspire companies like Todd AO and later Dolby to pursue multichannel 
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exhibition possibilities (Blake 1984, 20). The coupling of image and sound develop- 
ments presented a unique opportunity for Disney to reshape not only animation as 
form but also the visceral experience of cinema itself for filmgoers by creating new 
immersive environments for audiovisual play. 

In their move to lead Hollywood cinema into the digital era, Pixar brought this 
approach up to date by embracing the power of computing for both image and 
sound production. They also coupled these developments with new exhibition for- 
máis from multichannel to 3-D presentation but always within the context of the 
character-based stories they wanted to tell. The blending of technological concerns 
and design considerations is never easy, however. As Steve Jobs has noted, “Pixar 
did an impossible thing . . . It blended the Creative culture of Hollywood with the 
high-tech culture of Silicon Valley . . . The best scientists and engineers are just as 
Creative as the best storytellers, just in different ways ... The Pixar culture, which 
respects both, treats both as equals” (Paik 2007, 295). Central to Pixar’s success 
has been the ability to forge not just a shared vocabulary between the two groups 
but a successful workflow as well. It is also important to point out that, at Pixar, 
technology is valued as a necessary component of the design process, not simply a 
high-tech item or piece of software to be purchased. 

To present a more complete understanding of the interplay between Computer 
graphics and sound, it is necessary to specifically address the background related to 
the integration of computer-generated images into the process (and cultural under- 
standing) of animation and live-action filmmaking. In my previous work on sound, 
I have discussed the integration of multichannel technology into the Hollywood 
studio system, and I have noted that within the film industry, technological changes 
have encountered significant challenges as concerns are often raised regarding eco- 
nomic viability, issues of standardization related to exhibition and production 
practices, fears of diminished quality control, and lack of widespread testing of 
potential success (Whittington 2007, 28). The introduction of computer-generated 
imagery software and hardware by Pixar into the production process faced all of 
these concerns. Pixar’s cofounder and president, Dr. Ed Catmull, has noted that 
during the period of early development in the 1970S and 1980S, both “artists and 
studios disliked and feared computers. . . . But even in that environment, a few 
small groups had the visión to use the Computer for picture making. At that time, 
we thought a lot about what it would take to make the process economical and 
practical” (Street 1995, 79). Ironically, Disney, in particular, was eager to intégrate 
computers into their highly successful 2-D animation production units — but also 
wary. Fearing a critical backlash about declining quality due to technological 
expedience, Disney did not publicly admit to using computers until the DVD release 
of Beauty and the Beast. However, during the 1980S, Disney worked closely with 
Pixar to create CAPS (Computer-Assisted Production System), a “digital ink-and- 
paint system that employ[ed] a sophisticated multiplane camera within a digital 
environment 55 (Street 1993, 79). The system shifted the labor-intensive process of 
inking and painting of cels to the digital realm. Ultimately, the Disney executives 
knew that computers would streamline the production process for animation; 
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however, their fears about Computer technology were realized somewhat when 
computer-animated productions eventually supplanted 2-D animation techniques 
and practices in popularity. 

While computer-generated images transformed the field of animation, the 
technology also had an equally profound impact on live-action films and their 
cultural reception by filmgoers in terms of expectations of credibility and verisi- 
militude of audiovisual design. As part of their technological focus, Pixar research- 
ers began developing new software, such as Motion Doctor, MenV (Modeling 
Environment) and notably RenderMan, in order to create virtual characters and 
environments and to intégrate or “composite” computer-generated images with 
live-action footage. With this innovative software, artists could engage in a image 
Processing,” specifically modeling photorealistic images, characters, and sets and 
incorporating realistic surfacing, lighting effects, and motion blurs that simulated 
the way photographic film captured action in motion. Early examples of the soft- 
ware’s importance can be seen in the water-pod effects in The Abyss (1989), the 
liquid-metal effects in Terminator 2: Judgment Day (1991), and the design of the 
dinosaurs in Jurassic Park (1993). The software continúes to be the foundation of 
most special-effects software today, and this technology arguably enabled the cin- 
ematic adaptation of books and graphic novéis that Hollywood producers once 
considered unfilmable — firom the Lord of the Rings series (2001-2003) to Watchmen 
(2009). Not surprisingly, one of the early adopters of the software was George Lucas, 
who pressed the technology into use to create his densely layered Star Wars pre- 
quels. As many crides have noted, this intervention raised the important question, 
In this new age of computer-generated images, what is animation? 

Between the use of Computer graphics for specific special effects and its use to 
create virtual sets, characters, and action, a blur between the categories of the live- 
action film and animation began to occur. In turn, this breakdown shifted the cul- 
tural reception ofvarious image-sound relations, mainly related to “special effects.” 
Michele Pierson, the author of Special Effects: Still in Search of Wonder , argües: 
“If an effect is only special in relation to something else — something that it isn’t — 
how do viewers decide what is a special effect in this context? Does the scope for the 
kind of transmutation of the visual field that might make an effect special even 
exist once a film begins to be made o ver in the mode of an animated feature?” 
(Pierson 2002, 152-53). Initially, filmgoers and critics often contained the “work” of 
computer-generated images within the category of spectacle or special effect and 
addressed specifically the verisimilitude and authenticity of particular construc- 
tions. In short, filmgoers and critics were asking, “Does this special effect look 
like an effect?” And for the longest time, because of the rudimentary nature of 
the Computer graphics (which were heavily support by sound), this was an easy 
question to answer. 

However, this question lost it authority as the integration of live-action and 
computer-generated images became more refined and seamless and as computers 
have made their way into every aspect of the mode of production firom color 
timing to film printings. As a result, the clear distinction between live-action and 
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computer-generated animation has fallen away. Contemporary blockbuster films 
such as Spiderman (2002), The Hulk (2008), and Avatar (2009) regularly combine 
both forms, and the result has been a reshaping of cultural expectations around 
image-sound relations for these types of films. The debates have moved from 
questions about credibility to questions of immersion, visceral spectacle, and 
emotional resonance, as they support a new “cinema of sensation.” This shift 
has also been supported by the rise in consolé video games, which offer various 
game environments that regularly blur the lines between live-action and animation 
without significant objection or notice from gamers. Historically, Pixar’s films 
benefited greatly from this blurring of the lines. Specifically, computer-generated 
animation is no longer marginalized into the category of cinematic gimmick 
or novelty; rather, it has established itself as a unique form of filmmaking. As direc- 
tor Brad Bird argües, “Oftentimes people cali [computer] animation a genre, 
and that’s completely wrong. It’s a médium that can express any genre” (Corrlis 
2004, 80). 

In addition to creating a new form of filmmaking, computer-graphics technol- 
ogy has even challenged the hierarchies and divisions of labor that once separated 
the areas of visual design such as lighting, set design, cinematography, and costum- 
ing. Today, one artist with a Computer workstation and the right software can per- 
form all of the tasks of set designer, costumer, cinematographer, and even lighting 
gaffer if needed. In many ways, the collapse of these duties and the merged produc- 
tion processes mimic the way in which sound design emerged as a new mode of 
production. It is historically important to note that sound design leads the way in 
theory and practice in this era of visual culture. I would even go so far as to argüe 
that sound design is the crucial factor that has fostered the new perceptual gestalt 
of computer-generated animation in the digital age — a model of understanding 
that computer-generated images cannot fully achieve alone. Despite the efforts of 
Computer images to seamlessly bring together collages of unexpected elements — 
animated or live-action — the current historical poetic of cinema in the digital 
age demands that the sound design present equally refined constructions to estab- 
lish cinematic verisimilitude. In the article “A Back Story: Realism, Simulation, 
Interaction,” Andrew Darley provides a systematic history of visual digital culture 
and even specifically addresses Pixar’s position in the development of visual “tech- 
niques (programs) ... for the various phases and procedures involved in three-di- 
mensional modeling, animation, and rendering,” but, like many others, he does not 
consider the role of sound or music (Darley 2000, 20). One of my aims in this chap- 
ter is to offer a counter-history that ineludes sound design in the discourse of the 
digital media. 

But this raises the question, How do these computer-generated images and 
sound design come together to be “true?” As various case studies of Hollywood 
studios in the past have shown, new production processes and shifts in technology 
are often contained within aesthetic and stylistic perimeters set by the studios 
or production units. These factors become part of the development of a “house 
style,” which filmgoers eventually recognize and come to expect from that studio. 
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Pixar’s films established not only a new and innovative visual style but also a sonic 
style that has defined its house style. 


Pixar Studio Style: Character 
Animation, Storytelling and 

SOUND DeSIGN 


From the very inception of Pixar, the founders were determined to focus their 
efforts on creating character-based films by using Computer animation. According 
to Toy Story director, John Lasseter, “The animation was the groundbreaking thing 
about this movie; but our intention was for the audiences to get so caught up in the 
story and characters that they would forget the animation” (Street 1995, 91). This 
move was in part facilitated by an acceptance of the limitations of the Computer 
technology. It is an axiom within filmmaking that if filmgoers become invested in 
the plight of characters in a story, they will forgive all manner of technical limita- 
tions — and early Computer animation faced many limitations. The processing 
power and memory of the early hardware was severely limited. Specifically, because 
processing speeds were slow, longs hours were required to render a single image, 
and, once rendered, the image files presented significant storage and access prob- 
lems. More important, the software was not adept at creating “organic” images, 
particularly in regard to modeling animáis with fur or human figures (Corliss 2004, 
80). For these reasons, the initial short films, like The Adventures of André and 
Wally B (1984) and Luxo Jr. (19 86), featured characters that were “geometric” and 
relatively smooth in their visual design (Corliss 2004, 80). In order to mask the 
rudimentary nature of these images, the filmmakers at Pixar employed a host 
of cinematic strategies from an emphasis on character-driven stories and genre 
elements specifically drawn from the tropes of comedy and drama to experiment 
with formal aspects of the médium, such as rhythmic editing and camera moves, 
innovative music placement, and hyperrealistic sound design, which established 
the Pixar house style. 

Their first short, The Adventures of André and Wally B, is a foundational exam- 
ple of this new style. The film, which was created for the SIGGRAPH (Special 
Interest Group on Graphics) conference in 1984, features a character named André, 
whose blissful afternoon is disturbed by a friendly and playful bumblebee, Wally B. 
In an unexpected homage, the character’s ñames are derived from the 1981 Louis 
Malle film, My Dinner with Andre , and the short also gives a nod to the existential 
questions raised in the Malle film by referencing the playful nature of connections 
between people that are both pleasurable and painful (Amidi 2009, 14). In this 
short, a collaboration of animation styles — both oíd and new — are evident. The 
plot is a simple chase in which the anxious André distraéis Wally B in order to run 
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away, thus avoiding a painful sting. However, Wally B zooms after his new friend 
and delivers an inevitable offscreen sting. Floriane Place-Verghnes, author of 
Tex Avery: A Unique Legacy , places this idea in historical context: “The chase is a 
recurrent theme in the Averyan corpus and, generally speaking, in the cartoon 
industry of the late 1940S” and is primarily “an element of acceleration of the rhythm 
which can lead to total madness” (Place-Verghnes 2006, 137). In this way, the chase 
calis up the codes of the madcap comedy, a genre category defined by comedians 
such as Buster Keaton, Charlie Chaplin, and Laurel and Hardy. 

While the sound ratchets up the comedie pace through an ever-increasing 
tempo related to both the music and the sound effeets, the characters are the center 
of attention. Sound design is crucial in creating the integrity of the characters and 
the comedy of this short, as it both humanizes the characters and lends live-action 
credibility to their actions. One moment in the film is particularly telling: When 
André awakens, he comes alive with the sound of a yawn, a scratch of his belly, and 
a yowl that shivers through his frame. The sound design has done what the Com- 
puter images cannot: It has rendered the character’s internal structures (specifically 
through the breath and vocalizations from his lungs and mouth), his surface 
texture (through the sound of his hand on his belly), and the integrity of his body 
(the elasticity of his frame and spine through his throaty and quivering vocaliza- 
tion). The sound designer for this short, Ben Burtt, based his sound-design philoso- 
phy on a balance between how an effect might sound in the physical world with the 
overall dramatic needs or “truth” of the cinematic construction, and both of these 
factors establish the credibility of this character within the computer-generated 
environment (LoBrutto 1994, 142). 

Lest we forget, this is also a chase comedy, and some elements of traditional 
cartoon sound and music strategies are blended in as well to transcend the rudi- 
mentary image design and to update familiar cinematic gestures as a form of 
acknowledgement and homage. In particular, Wally B’s introduction is character- 
ized by the sound of his flapping wings and buzzing, but these sounds are super- 
seded by the self-reflexive placement of the music from “The Flight of the Bumble 
Bee.” It is clearly the sonic setup for a joke, which will end with a sting. During the 
chase, sound and picture editing play a crucial role as the camera angle shifts to a 
bird’s-eye view, and Wally B’s flight sounds undergo a metamorphosis into the 
sound of a propeller-driven dive bomber dropping down to deliver its payload. 
This sound design calis on a long history in animation, in which sounds of the 
natural transform into the mechanical, and the humor is evoked as the “character 
loses his fluidity and becomes stilted” (Place-Verghnes 2006, 137). The audio and 
visual denouement of the short calis on another staple of animation — the role 
reversal, which is illustrated when an offscreen André (just stung) hurls his hat at 
Wally B and hits him on the stinger, knocking him out of frame with an onomato- 
poeic “boing.” The evocation of the history of animation style and the new atten- 
tiveness to sound-design elements such as sound perspective, localization of sound 
on- and offscreen, and codes of live-action verisimilitude establish a new gestalt of 
understandings and expectations for filmgoers. As Ben Burtt notes, “It’s forging 
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those connections between familiar sound and illusionary sound that I think is the 
basis of the success for a lot of the sound that sound designers have put into these 
[Pixar] movies” (Milani 2008, 1). 

Pixar’s style expands on animation tradition by embracing anthropomorphism 
and recasting it to assist in the design of character “speech,” which often mimics 
human sound patterns and behaviors and even combines human sounds as familiar 
anchors. In the short film Luxo Jr., Gary Rydstrom offered director and animator 
John Lasseter an example of this new sound-design strategy: 

I wanted to give the lamps in Luxo Jr. character through sound. I told John that 
Pd come up with these voices. He’d never imagined they’d have voices and was 
wary of the idea. But I experimented with taking real sounds — a lot of it as simple 
as unscrewing a light bulb or scraping metal. Every once in a while, a sound 
would be produced that would remind you of sadness or glee . . . It felt like the 
birth of something new, even then. (Kenny 2004, 2) 

This design approach established an emotional vocabulary for the characters 
through sound effects, thereby offering an analogy to language. In the film, a tiny 
Luxo lamp plays with a beach ball while being supervised by a parent lamp. After a 
mishap that pops the ball, the adult lamp admonishes the pint-sized Luxo with a 
snap of attention and a wag of its head. The sound design for the adult lamp focuses 
on punctuating the neck or joint movements of the apparatus, specifically the sound 
of the snap of the joint and its silence, then a wagging and clicking of metal, an 
audiovisual gesture that thematically supports the lamp’s role as guardián and 
responsible adult. The visual design reinforces this idea as well by incorporating a 
light source that shines down on the tiny lamp like a spotlight on the bad behavior. 
Luxo Jr. deflates with a musical whine accompanied by a deep sigh of metal as if 
stifling tears. In terms of narrative storytelling, the short taps into the archetype of 
parent-child dynamics, while also offering a comment on the roller coaster of emo- 
tions that young children experience from moment to moment, following the 
ecstasy and joy of play to the deepest sadness from a sense of the loss of a beloved 
toy. In this way, the story is thematically about the play of emotions. 

According to Lasseter, this short film solidified Pixar’s philosophy about 
film sound and storytelling: “Gary’s brilliant work made those lamps so real, so 
believable. It taught me that sound has an incredible ability to be a partner in the 
storytelling of a film, and ever since then Pixar has put a lot of emphasis on thinking 
of the sound as we develop our stories” (Paik 2007, 72). It is, therefore, not surpris- 
ing that the Luxo Jr. lamp has become the unofficial mascot of the company and 
even forms the “I” in the company’s logo before all of its feature films. 

Character-based design strategies are evident in nearly every aspect of the film- 
making process at Pixar from behavior modeling to set design, yet the philosophies 
that drive these constructions can be traced back to sound design. As John Lasseter 
notes: 

Our philosophy for the set [design] carne from Gary Rydstrom, our sound 
designer ... He taught me long ago that, in doing sound effects, if a ball bounces, 
you don’t just record the sound of a ball bouncing — because when the sound 
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effect is cut in, it won’t sound like it should. You have to make it bigger. To create 
the bark of a dog in Toy Story , Gary combined dog sounds with tiger sounds to 
make it bigger and more impressive. That’s the philosophy we used in the look of 
the film. We went beyond reality, caricaturizing to make it more believable. 

(Street 1995, 83) 

This philosophy encouraged artists to consider both the images and the sounds as 
characters in the film. One of Rydstrom’s most important innovations at Pixar was 
to establish this sense of “hyperrealism,” in which sound and images take on múl- 
tiple layers of meanings and emotion by concentrating not simply on aspects of 
film form but also on emotional intents, expectations, and intertextual connections 
such as historical homage. For Rydstrom, this often meant displacing a sound from 
its original context, augmenting it, then reinserting into a new sound-image pairing 
to create a heightened effect like the tiger sounds inserted into the bark of a dog. 
However, rather than entirely stripping the sound of it previous meanings, Rydstrom 
was borrowing various qualities of the realistic or raw effect and effectively splicing 
them into a new creation that filmgoers would crane their ears to recognize and 
accept. For Rydstrom, sound design was the key to establishing the credibility of 
Computer animation: “It was clear to me that this form was a whole new thing that 
would require a whole new approach to sound” (Paik 2007, 72). 

The philosophies of hyperrealism became firmly established at Pixar in the 1989 
Academy Award-winning short Tin Toy , which became the inspiration for Toy 
Story . The plot centers on “Tinny,” a mechanical versión of the one-man band, and 
his first encounter with a curious baby. Terrified at the prospect of being drooled 
on, Tinny eludes baby “Billy” and hides under a couch, which is where all wide- 
eyed toys seem to go to elude destructive children. But when the baby stumbles 
during the chase and begins to cry, Tinny emerges to enchant the child with his 
musical abilities. The moment is fleeting, however, as the baby quickly moves on to 
something more enticing — the discarded packaging for the toy. It is a humorous 
commentary on the attention span of children. It is also the perfect film to exploit 
the thematic notion of sound design and music as a form of play. 

By today’s standards, the computer-generated images in this short appear 
somewhat crude as the baby’s oddly textured skin makes it appear more monstrous 
than perhaps intended, but both the character-based storytelling and the sound 
design transcend these deficiencies. Once again, there is no dialogue in the film; 
rather, the images and sound evoke the sense of humor and peril. In dramatic terms, 
the filmmakers cleverly devised Tinny to be his own worst enemy. His very nature 
is sonic by the fact that various musical instruments are woven into his jacket. His 
movements are, therefore, musical — so any chance of a stealthy retreat under the 
couch is impossible. This type of connection between movement and sound in ani- 
mation has become known in animation as “Mickey Mousing,” whereby the image 
and sound elements are connected in self-aware synchronization, and, of course, 
the process is named after the famous Disney character. 

While this comedie idea is drawn from traditional animation, it has been com- 
pletely redesigned for computer-animation. Using the Synclavier, Rydstrom created 
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a complex database of sounds, including gears, cymbal crashes, drum hits, and 
horns, as well as eyelid flutters, shivers, and breathing. The instruments were not 
simply just recorded toy versions but they included also a series of real instruments 
of different scales, which were recorded with a resonant dynamic range and stacked 
as needed. In establishing the sound design pattern, Rydstrom composed the sound 
in layers and engaged them at various tempos, so they not only realistically fit the 
movement and actions of the character but also serve as score, which heightens 
the frenetic sense of peril. According to Rydstrom, “The complication for sound 
in Tin Toy was to make it sync up with the animation . . . John [Lasseter] didn’t 
anímate the cymbal and drums with the idea of what music he would play” (Amidi 
2009, 24). 

Along with representing the musical exterior of the character, the sound design 
also illustrates the inner life of the character through the emotive use of sound. In 
particular, when the baby falls and begins to cry, all eyes turn toward Tinny, and the 
accusation is registered through character reaction and sound. With an immediate 
click of his cymbal, a blink, and the clicking of his neck gear, Tinny turns and, in 
surprise, realizes what he has done. Shame overcomes him, and he lowers his head; 
his accordion deflates with a sound like a mournful groan. In these instances, sound 
provides the emotional language that renders the inner States of the character, 
which in turn tells the filmgoers how to feel. 

In pursuit of an overall unified design, Rydstrom sets these sounds in relief 
against a bed of immediately recognizable vocal and ambient effects, always balanc- 
ing traditional animation strategies and live-action expectations. In particular, 
Rydstrom recorded and edited the vocalizations of a real baby to support the rudi- 
mentary images of the “monster child” in the film rather than employing an actor 
to perform these sounds, which was standard practice in animation at the time. The 
child’s coos and giggles are recorded and edited to mimic a documentary aesthetic 
style and even inelude an unscripted sneeze, which was animated into the character 
design. Rydstrom further heightened the live-action qualities of the environment 
by implying offsereen space by means of sound effects. Through an open doorway, 
the sound design implies the audio from a televisión set as someone channel surfs 
briefly before finally tuning into to familiar game show The Price Is Right. The 
recording features a sense of spatial encoding. The televisión effect is recorded from 
the perspective of the room on-screen, implying distance, and the chatter is also 
compressed and muted to give it a sense of the size of the room and the televisión 
speaker. Both of these sound constructions serve to create a psychoacoustic gestalt 
of a living space that is unseen by the camera, and the rapid editing of contení 
implies the presence of a person who is changing channels, perhaps a parent just 
home from a trip to the toy store who is taking a break from child care. The balance 
of this pattern of effects creates a heightened sense of cinematic reality, effectively 
smoothing over the technical wrinkles in the computer-generated animation. 
Rydstrom reiterates the importance of Tin Toy : “I think it’s the most sound- 
intensive movie per square inch that iVe ever done” and for Pixar revealed the 
importance of sound design in “shaping the contení of the film” (Paik 2007, 72). 
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In any discussion of Pixar’s house style, it is essential to discuss the role of both 
music and the voice; however, these topics could merit another chapter entirely. 
Instead, I briefly explore how Pixar’s approach has reshaped the functions of both 
of these elements to fit their design philosophies. In the 1989 short film Knick Knack , 
director John Lasseter paid homage to Warner Bros.-style cartoons by presenting 
the story of a snowman who is trapped in a snow globe but is eager to join the 
frivolity of a shelf full of warm-weather knickknacks. The visual design evokes a 
“retro vibe” that draws on trends present in “19505’ modernism” (Amidi 2009, 25). 
However, the music in the film is not orchestral as was traditionally the case in 
the early Warner Bros, animated films; rather, it features the instrument of a 
single voice, specifically that of performer Bobby McFerrin. McFerrin, who is 
perhaps best known for his hit “Don’t Worry, Be Happy.” He typically employs his 
entire body in a composition, using his voice to follow the melodic line while 
employing his chest to beat out the rhythm section and when necessary, using edit- 
ing and re-recording techniques to fill in the gaps. His vocal style and his technique 
are akin to animation because they draw on the strategy of anthropomorphism to 
recast the voice in place of the various orchestral instruments. True to the Pixar’s 
sound style, this composition, which draws on 1950S’ bebop becomes an idiosyn- 
cratic character within the piece by providing a human rhythm that activates the 
computer-generated images. Specifically, the various characters, including a cactus, 
a pyramid, and a pink flamingo, “come alive” and sway to the beat of the music, 
which is situated as both score and source. 

Within the hierarchy of image-sound relations, Pixar’s use of music in this 
short flips the audio paradigm to resemble that of the musical. According to anima- 
tion historian Scott Curtís, this is a similar strategy engaged by early Warner Bros, 
cartoon shorts: “Given that the tempo of the music has already been decided upon 
in any given cartoon [as a result of the music selection], the C mise-en-scéne’ enacts 
that tempo in a variety of ways . . . characters sing and . . . buildings also sway” 
(Curtis 1992, 200). This acknowledgement of the power of music to drive animation 
became a factor in the audio and visual design of subsequent shorts, specifically 
Boundirí and One Man Band. However, Pixar has staunchly avoided allowing music 
and the conventions of musical genre to become the solé driving factor of anima- 
tion in its feature films; rather, it has remained committed to a more blended 
approach that balances the use of sound design and music in the storytelling pro- 
cess. In this way, the company has distinguished its house style from that of Disney’s 
successful 2-D animated units, which produced musicals like Beauty and the Beast 
(1991) and Aladdin (1992). At the end of Knick Knack , Lasseter and Rydstrom even 
draw our attention to the power of a blended approach by offering a transgressive 
use of “score,” which comically comments on the fate of the written word in light 
of potential computer-generated animation. While the credits roll, a voice reads 
along, presenting not a recitation of the written text but a rhythmic reading of 
the phrase “blah blah blah.” The use of the phrase is both musical and satirical 
as it is a playful reference to both the ineffective nature of voice-over and a sense 
of nostalgia for what many filmgoers might recognize from Peanuts cartoons as 
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“adult speak,” which, like most film credits, is something that fails to hold anyone’s 
attention. 

This is not to say that Pixar has given up on speech and language entirely. 
Rather, the shorts and the Pixar house style eventually do find a place for both, but, 
once again, the filmmakers rethink their use in light of the particular needs of Com- 
puter animation as a new film form. Just as Rydstrom’s philosophy of hyperrealism 
borrowed the codes and emotional components of specific sound effects, so Pixar’s 
use of the voice attempts to do the same by borrowing the credibility of celebrity 
voices to bring its computer-designed characters to life. With the release of the 
short Mike’s New Car (2002), Pixar moved the production of their shorts in a new 
direction by specifically linking them to their feature-length productions through 
voice casting and character crossover. In this instance, the buddy team of Sully 
(voiced by John Goodman) and Mike Wazowski (voiced by Billy Crystal) from 
Monsters , Inc. comes together for a test drive of Mike’s new vehicle, which not so 
surprisingly ends in madcap mayhem as the car’s gadgetry gets the best of them. 
The voices bring a star quality that serves as a recognizable anchor for both the 
production design and the filmgoers. According to animator John Kahrs, “John 
Goodman’s vocal performance was really rich and had a lot of range . . . There is a 
resonant warble to his voice, almost bear-like, and it fits the character so well. 
I would get direction from his performance and know exactly how the eyebrows are 
going to move and what the emotion of the scene is going to be” ( Monsters , Inc.: 
Production Notes). By contrast, Billy Crystal established Mike Wazowski with vocal 
qualities of an East Coast origin, specifically by engaging in fast-talking New York 
manía, which is incorporated into the physicality of the character through frenetic 
arm and leg gestures. According to lead character animator Andrew Gordon, “Billy 
would take a line and go off on lots of tangents with ad-libs and comedy routines” 
C Monsters , Inc.: Production Notes). 

The voice itself becomes a kind of special effect, which is much like the image- 
sound relations found in puppetry. Sound theorist Michel Chion has argued that in 
the 1970S the nature of the voice in Hollywood cinema changed significantly as 
filmgoers were made aware of its constructed nature in films like The Exorcist (1973) 
and Star Wars (1977), in which the voices seemed “stuck on” to various characters 
in makeup or masks (Chion 1999, 164-65). Chion makes the connection directly to 
puppetry: “We’re constantly aware that voices are grafted onto bodies, only tempo- 
rarily on loan” (Chion 1999, 154). Pixar’s house style acknowledges this hyperaware- 
ness and in fact depends upon the filmgoer’s knowledge of these actors 5 personas, 
past work, and vocal qualities to lend credibility to their virtual incarnations. 
Technically, these efforts are further supported by animation software that enhances 
vocal synchronization in relation to the characters 5 on-screen facial gestures. As 
sénior animator Peter Docter (director of Monsters, Inc.) noted in relation to the 
voice and animation style on Toy Story , “We have a program that enables us to look 
at a sound wave and break it down into frames. I listened to the sound over and 
over again, then did an assessment of the pitch of the words” (Street 1995, 85). These 
cues then became the reference points for facial gestures and body movements, 
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while “lip-synching was facilitated by a library of mouth poses that could be used to 
form the various sounds” (Street 195, 87). 

This is not to imply that Pixar has allowed the voice to take complete priority 
in the design process as it might in a live-action film; rather, the vocal design 
supports the overall filmmaking philosophy and house style. One of the ways in 
which Pixar achieves this goal is to direct voice talent toward a particular kind of 
skewed readings. As a result, many of the vocal performances are mannered and 
caricaturized somewhat with an elastic quality. Billy CrystaPs performance is a 
prime example. His vocal timing and rounded enunciations are presented in fits 
and starts much like someone learning to drive a stick-shift car. They beg to be 
animated in the form of a giant green eyeball with a large and expressive mouth, 
evoking the performance codes of puppetry. Once again, this approach has been 
formulated somewhat around the limitations of the animation software, which 
despite years of development still has difficulty rendering the human form. 

Pixar has worked around these limitations by embracing a unique style of 
human character design that focuses on geometric shapes and patterns similar to 
those found in comic books. This style is evident in the crossover short Jack-Jack 
Attack (2005), directed by Brad Bird (The Incredibles ) . The storyline features another 
child-care situation, in which preteen Kari (voiced by Bret Parker) must contend 
with the developing superpowers of Jack-Jack, the offspring of two superheroes. 
Unlike the baby in Tin Toy , Jack-Jack’s skin is smooth, and his head is round with 
a triangular sprig of hair on top. Kari’s facial features are similarly variant, with 
particular emphasis on her elastic mouth and braces. As with Billy Crystal, Kari’s 
vocal performance hinges on idiosyncratic timing from the lethargic to the manic. 
The comedy resides in the scale of the performance contrasts, which range from the 
naturalistic to the cartoony. In this way, sound takes on hard edges that match the 
geometric visuals. Bird notes that, in designing these characters, he strived to make 
them both “caricatured and believable,” an approach that “Disney used to cali . . . 
The plausible impossible’” (Corliss 2004, 80). In the end, this assessment is 
an apt description of the overall audiovisual design philosophy and house style of 
Pixar Animation Studios as it incorporates both an understanding of the animation 
traditions of the past and a visión of computer-generated animation in the future. 


CONCLUSIONS 


Pixar Animation Studio continúes to use the development of short films as a train- 
ing ground for new producers, directors, animators, and sound designers. Each 
effort continúes to provide the company with valuable research and Creative out- 
comes. Pixar’s house style, therefore, is by no means fixed; rather, it continúes to 
develop under the influence of new technologies and personnel. One glimpse of the 
future, however, may be found in Pixar’s support of research on sound synthesis at 
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CornelFs Department of Computer Science, where Computer graphics researchers 
are developing the equivalent of computer-imaging software for sound. Using 
physics and software modeling, researchers are simulating the sound of water, ren- 
dering noise vibrations, splashes, and even the formulation of bubbles in synchro- 
nization with computer-generated images of water provided by Pixar as if both 
existed in the real world (Steele 2009, 1). If widely implemented, the implications of 
synthesized-sound software could shift both the production of animation in film 
and video games radically. For instance, sounds in video games could be pro- 
grammed for real-time activation and origination, coming firom specific on-screen 
actions or environmental sources rather than from sound files for sound effects and 
dialogue that are recalled by the program and repeated based on activation points 
or predetermined gestures. Sound and image could occur in real time, just as they 
do in the physical world. This approach would strengthen the immersive quality of 
video games, which has long been one of the primary projects of game develop- 
ment. In animation development, sound and image constructions could be designed 
together by the same software and designer, a development that could once again 
lead to a further collapse of duties and hierarchies within the mode of production. 
Ultimately, the roles of the animator and the sound designer could merge, offering 
the potential of not only a significant shift in Pixar’s house style but also a realign- 
ment of the cultural reception and our expectations related to sound and image 
design in the digital age. 


Table 15.1 Online Resources 


Cornell Harmonio Fluids Project 

http://www.cs.cornell.edu/projects/HarmonicFluids 

Digidesign (Pro Tools) 

http:// www.digidesign.com 

Dolby Laboratories 

http://www.dolby.com/index.html 

FilmSound.org 

http://filmsound.org 

Pixar Animation Studios 

http://www.pixar.com 

Pixar’s RenderMan 

https://renderman.pixar.com 

Skywalker Sound 

http://www.skysound.com 

William Whittington, PhD 

http://web.me.com/williamwhittington 


NOTE 


1 Foley is the term given to those effects created on a sound stage in synchronization 
with the picture. Ambiences are layers of background noises, which often form environ- 
mental aspects such as busy Street noise or waves on a beach. 
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CHAPTER 1 6 


THE AVANT-GARDE 
IN THE FAMILY 
ROOM: AMERICAN 
ADVERTISING 
AND THE 
DOMESTI CATION 
OF ELECTRONIC 
MUSIC IN THE 1960S 

AND 1970S 


TIMOTHY D. TAYLOR 


Sound doesn’t knock on your door — it comes right in. 

— Raymond Scott, “Raymond Scott 

Sounds Off on Sound” 


I would like to acknowledge Suzanne Ciani for her willingness to be interviewed, 
Bob Kosovsky at the New York Public Library for his help with the Eric Siday Archive, and 
the many participants at the Sound Studies conference in Maastricht, the Netherlands, in 
November 2009. 
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Introduction 


One of the interesting aspects of music in advertising in the United States is that 
modernist musical techniques or idioms — atonality, widespread use of dissonance, 
complex rhythms, and so forth — never really found their way into the production 
of this music. Many writers of advertising copy who aspired to become a “serious” 
novelist got their start in advertising, and the same is true of many visual artists (see 
Bogart 1995). But nothing resembling musical modernism in the sense described 
here ever found a place in advertising music. 

With one exception: Some new sound technologies found their way into adver- 
tising music quite quickly, following fairly closely behind their use in avant-garde 
music. Advertising musicians’ early adoption of electronic technologies meant that 
many Americans, unaccustomed to listening to avant-garde music of any kind, first 
heard sophisticated new electronic technologies such as the Moog synthesizer 
through less prestigious venues such as soundtracks to Science fiction films and 
televisión and radio commercials. To be sure, many had probably heard the 
Theremin and other electrical sounds, but the invention and widespread use of 
electronics in the 1960S marked a new era of the proliferation of such innovative 
sounds. No less a figure than Robert A. Moog wrote the following in 1977: “The 
listening public first became aware of the elctronic [sic] médium subliminally, 
through radio and TV commercials.” Moog mentioned some of the figures I discuss 
later (Moog 1977, 859). 

While this may seem strange, in the period when electronic technologies were 
appearing, advertising budgets were nearly unlimited (see Taylor n.d.). Advertising 
agencies spared no expense in hiring musicians, who were thus able to purchase 
these expensive technologies. Eric Siday, one of the first to use electronic music 
commercially, was paid about $5,000 per second (more than $33,000 today) in the 
mid-i96os (“Swurpledeewurpledeezeech!” 1966, 68). 

Today, Americans are surrounded by electronic sounds of all kinds: as recorded 
music, as heard in films and televisión, as telephone hold music, or as the opening 
sound splash that accompanies powering up a personal Computer. But such sounds 
were not always so ubiquitous or accepted. The early history of electronic music in 
the United States in the post-World War II era, whether in the realm of art or com- 
merce, is a history of suspicion, derision, and confusión. Countless árdeles in the 
popular press critiqued or ridiculed the music and frequently greeted it with incre- 
dulity or scorn. 

This chapter, then, offers a history of domestication, the domestication of 
sounds that were initially associated with Science fiction but fairly quickly found 
their way into televisión commercial pitches for anything from coffee to beer. These 
sounds might have originated in public cinemas, but they became ubiquitous in 
family rooms across America. Helpful here is the Science and technology studies 
(STS) work on the concept of domestication (Oudshoorn and Pinch 2003; 
Silverstone and Hirsch 1992). Domestication in the STS literature tends to focus on 
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technologies, but here we are dealing with sounds, producís of technologies. 
The focus, unlike for Silverstone and Hirsch (1992) is not the household but the 
Computer music studio and the laboratory. 

I use the domestication concept in three ways. The first is the most literal — 
the adoption of sounds associated with Science fiction to use in selling producís 
in everyday life. However, I also mean it to describe the complex set of pro- 
cesses behind the scenes by which electronic sounds were harnessed for use in 
selling. How did electronic musicians convince potential clients in and out of the 
advertising industry that such sounds could be accepted for use in advertising? 
Finally, I intend the term to refer to the ways in which inanimate commodities were 
thought to become friendly products for consumers, brought to life by electronic 
sounds. 


Electronic Music: First Associations 


It is perhaps not surprising that the first main uses of electronic technologies for 
making music outside of the rarefied realm of art music composition were in sci- 
ence-fiction films. Electronic music could represent the future as depicted in such 
films, a future that included “music without musicians,” as the title of one article on 
electronic music put it in the late 1950S (Tall 1957). Electronic technologies in sci- 
ence-fiction films date back to the 1950S. Louis Barron (1920-1989) and Bebe Barron 
(1925-2008) are best known for composing the soundtrack to the film Forbidden 
Planet in 1956 (Barron and Barron 1989). As a result of this film, their prominence 
made them sought-after musicians for electronic music in commercials. The way 
their work was discussed helped contribute to popular ideas about electronic music, 
as this quotation from Mademoiselle in 1959 shows: 

Louis and Bebe Barron create film scores with electrons, activated into electronic 
circuits that seem to express emotional characteristics. When stimulated, the 
circuits respond with meaningful sounds, as if alive; the film track is composed 
by altering and editing them on tape. The eerie results communicate emotions 
without the intervention of symbols. (“Music of the Future?” 1959, 94) 

For a long period after the rise of electronic music in advertising in the 1950S 
and into the 1960S, advertising-music composers had to contend with the public’s 
perception of such music as strange, fit only for representing the unreal and hor- 
rific. In a speech on October 16, 1969, Eric Siday said, “the failure of electronic 
music to, as yet, escape the confines of the experimenters, space film sound tracks, 
and the world of the avant garde for much wider acceptance is that the listener has 
not only been expected to accept a new and strange sound — but to learn a whole 
new musical language .” 1 In addition, Raymond Scott, a composer/inventor who 
was another of the earliest composers of electronic music for commercials, said in 
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1962 that, to the public, electronic music “sounded like nuclear war and at its tamest 
like ‘outer space’ music.” 2 However, he continued, “If that was all you could do 
with electronic music, I’d say it had no place at all to speak of in TV background 
and scoring” (liner notes to Manhattan Research Inc. 2000, 122). 

Scott’s fears were reasonable. Early coverage of electronic music in the popular 
press in the United States frequently played up its seemingly otherworldly or hor- 
rific attributes. A 1961 article about an electronic music concert quoted a woman 
from the audience: £C A 11 those spiders in the sky!” (Bowers 1961, 60). An article from 
1962 admitted that electronic music “ may sound like static from outer space ” and 
stated that sinus tones have a “spooky impersonality [that] carries an implicit con- 
notation of space-travel, music from Mars, the machine personality” (Moor 1962, 
49, 50; emphasis in original). Another article that same year noted composer 
Vladimir Ussachevsky’s annoyance at the terms “push-button music” and “space 
music” (“The Sound of Hell,” 1962, 86). In 1968 an article by an anonymous writer, 
surveying twenty years of electronic music, noted, “Electronic music had its uses. 
Its Science fiction sounds were wonderful for TV commercials and moving-pictures 
seo res” (Discus 1968b, 157). 3 A few months later, this author praised what he believed 
was a new trend in electronic composition, moving away from “abstraction and the 
science-fiction sound” (Discus 1968a, 165). Cartoons in the popular press through- 
out the 1960S showed images such as dancing robots (Moor 1962, 49) and a man in 
tails at a piano but looking at an electronic display instead of at the music on the 
stand (Berger 1969, 46). 

Some writers did believe that electronic music was no passing fad, for, in the 
words of one critic, “too many serious people are working on it, too many rockets 
are shrieking up into the skies, too many children are watching TV.” He lamented 
the cliché — even in 1961 — that associated electronic music with outer space (Bowers 
1961, 61). 

Despite the associations that the general public had for electronic music, Scott 
nevertheless felt that the unfamiliar sounds could sell producís in novel ways. 
A demo track titled “Don’t Beat Your Wife Every Night!” consists of a series of 
Scott’s audio logos (about which more later) with a voice-over of sample advertis- 
ing copy, a sort of rehearsal for the use of electronic sounds in commercials. In a 
1962 lecture Scott recounted how he assembled this track: 

After we put together the tape, we called in an announcer-friend of ours, “Bucky” 
Coslow. We said, “Listen to these electronic effects, we’ll play them one at a time 
and whatever they make you think of — commercial-style — say it real spontane- 
ous-like. We’ll record them, then later, we’ll have a mix and see what happens.” 

We did just that, with one difference: The announce tape was edited but before 
we’d got a chance to sync it with the effects tape, it was run purely by accident at 
random against the effects tape. The effect was startling. Words and phrases that 
had no business showing up where they did against certain electronical effects 
took on a wonderfully convincing and attractive quality and seemed to indicate 
that electronic music for this purpose may turn out to have unusual vitality, 
conviction, and atmosphere plus a rather shocking flexibility. (liner notes to 
Manhattan Research Inc. 2000, 111) 4 
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Scott also said, “Maybe the reason these effects are as attractive as they seem is 
because we are as yet not preconditioned to a species of electronic music, and con- 
sequendy our ears are ready to accept a more abstract marriage between the spoken 
word and electronic musical effects. But regardless, we believe that possibilities are 
most exciting” (liner notes to Manhattan Research Inc. 2000, 131). Thus, even com- 
posers had to convince themselves of the utility of these new sounds for commercial 
purposes. 

AUDIO EXAMPLE 16.1 


The Rise of Electronic Music 

IN COMMERCIALS 


The decade of the 1960S was one in which the postwar rise of American consump- 
tion was reaching new heights, as is well known. Thanks to the Servicemen’s 
Readjustment Act of 1944 (popularly known as the GI Bill, which made it possible 
for returning veterans to obtain an education or a mortgage), an unprecedented 
number of new houses were built in the 1950S, houses that were bigger than those 
most Americans had lived in before, marking both increased affluence and the 
increase in consumption of goods, which necessitated more space in which to hold 
them. A new and relatively prosperous middle class emerged for the first time in 
American history. 5 

The advertising industry grew along with the American population and its 
increasingly voracious consumption habits; billings to clients for advertisements 
increased by more than 30 percent every five years following World War II, with the 
first half of the 1960S increasing by nearly 78 percent, the largest increase in the 
couple of decades after the war. 6 

The rise of what is known in the industry as “parity producís” (those that are 
essentially similar, such as soft drinks or fast foods) carne to rely heavily on music 
to help create an impression of difference among similar producís (see L. Cohén 
2003 and Taylor n.d.). Budgets for the production of music grew, including, for the 
first time in any significant way, electronic music, once composers began to under- 
stand how to employ electronic music in commercials. The next step was to con- 
vince potential clients that such music could benefit advertising representations 
and sales of commodities. 
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Raymond Scott: Moments 

of Freshness 


Raymond Scott (1908-1994) was one of the first advertising musicians to compose 
music for commercials using only electronics, beginning late in 1960, for major 
national brands such as Vicks, Lever Brothers, Alcoa, Hamm’s Beer, and Parker Pen 
(Chusid 1999, 10). 7 

Scott (born Harry Warnow) was a Juilliard-trained musician who wrote and 
arranged jazz tunes for his own quintet and served as the musical director for Your 
Hit Parade on CBS radio. His long interest in electronics led to the founding of 
Manhattan Research, Inc., in 1946, a company that specialized in the development 
of electronic devices for musical use. He began producing recordings for a variety 
of purposes, including, beginning around 1960, advertising. At one point, the firm 
billed itself as Designers and Manufacturers of Electronic Music and Musique 
Concréte Devices and Systems. The use of the term musique concréte shows some 
cognizance of the European avant-garde of the 1940S and 1950S, which Eric Siday 
(discussed later) shared. 

Scott believed that he had found a way to domestícate electronic music, to 
employ it in everyday usages such as commercials that obviated its association with 
“nuclear war” and “outer space music.” He argued that, unlike in science-fiction 
films, electronic music: 

can be used in a light way, and in many mood-provoking ways, where instead of 
frightening the audience it will entertain them and help put over points and 
create memory images that are much more striking than could be done with 
conventional instruments or even with an unconventional handling of conven- 
tional instruments. Electronic background scoring is not intended as a replace- 
ment for conventional orchestral ensembles, but for certain moments of 
freshness, for certain kinds of products, and especially for an Audio Logos idea 
where a very special sound appears together with the appearance on screen of the 
product, or the sound or punctuation or some kind of audible underlining of a 
video moment, an announce moment or a combination of both has extremely 
powerful possibilities. (liner notes to Manhattan Research Inc. 2000, 135-36) 8 

For Scott, whose music bears a closer sonic relationship to the avant-garde than 
Siday’s, the problem of “lightness” was a significant issue in the process of making 
the music approachable and domesticated: 

I do believe that we have licked the problem of lightness in electronic music and 
that I wouldn’t be at all surprised as these months go by that the various things 
that we can do with it will become so wide in scope that there’s a good chance 
that we would be hard put to find something that electronics couldn’t do 
something for. (liner notes to Manhattan Research Inc. 2000, 136) 

In 1937 Scott released a promotional LP titled The Jingle Workshop. The 
text accompanying the recording referred only briefly to the use of electronic 
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technologies, showing just how carefully electronic sounds had to be presented to 
potential clients in this era: 

THE JINGLE WORKSHOP recognizes the growing need for a thoroughly 
qualified, completely staffed musical organization, equipped to apply a 
combination of special talents and professional skills to the production of 
successful TV and Radio Jingles. 

THE WORKSHOP feels that certain elements are vital to assure success in 
such a strongly competitive field: 

First — the experience and ability to analyze and satisfy the individual needs 
of each product. 

Second — the flair for brilliant, provocative, yet basically simple — 
composition of music and words. 

Third — the talent for “exact touch” orchestration. 

Fourth — the conducting skill to achieve a winning instrumental 
performance, perfectly balanced with vocal style and diction for proper 
product emphasis. 

Fifth — the electronic “know-how” to apply recording techniques and 
equipment with imagination. 

Sixth — the overall showmanship in taste and approach, to blend these many 
elements into an effective expressive performance. 

THE WORKSHOP sincerely believes that it has these aims in its operation, 
further 

THE WORKSHOP would welcome the opportunity to discuss your individual 
needs — either for new commercials or for fresh, exciting production of our 
existing jingles, (liner notes to Manhattan Research Inc. 2000, 24) 

The machine Scott used for composing was nicknamed “Karloff.” It drew upon 
two hundred different sound sources and employed several electronic tone genera- 
tors. Pitch, timbre, intensity, tempo, accent, and repetition were controlled by a 
panel. According to a 1960 magazine article, the machine: 

can do virtually anything. It can sound like a group of bongo drums. It can give 
impressions which suggest common noises. It can create the mood of musical 
tone-poems. And it can also take the advertiser’s theme music and produce 
limitless emotional variations on it to suit a variety of musical styles — all, of 
course, if Scott is at the Controls, (liner notes to Manhattan Research Inc. 

2000, 51) 9 

An article on Scott from Advertising Age in 1962 said that the average American 
was exposed to more than five hundred commercials per week (one of many such 
articles from this decade that worried about advertising “clutter” on the air, which 
clearly influenced Siday as well) and said that Raymond Scott had a way to cut 
through it: “Grab ’em by the ears!” — by using audio logos, “new plástic sounds,” 
and “electronic abstractions,” according to the bemused Advertising Age author 
(McMahan 1962, 119). 10 

These new electronic sounds were thought to be able to capture listeners 5 atten- 
tion in ways that more conventional sounds could not. Some of Scott’s descriptions 
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of various advertisements illustrate his approach. About a commercial for County 
Fair Bread in 1962, he said: 

We were invited to do an electronic music score of an animated TV spot on 
behalf of County Fair Bread. The problem: to create an electronic musical 
impression of a calliope playing: “Where, oh where (in this case) is my County 
Fair Bread?” Compose a “man in a white suit”-type theme, for the central 
character, a slightly knocked-out magician. Put together a group of electronic 
impressions of typical animated cartoon sound effects. Here are the sounds we 
carne up with: 

• Sound of the calliope; 

• Magician/“man in a white suit” theme; 

• Cartoon effects. (liner notes to Manhattan Research Inc. 2000, 130-31) 

The resulting commercial sounds like a John Cageian tape experiment such as 
Variations IV , with the jingle treated to a number of effects, sometimes being warped 
and, to my ears, parodied. It is astonishing that the client approved these sounds, as 
advertising clients, then as now, are notoriously conservative and deeply afraid of 
offending anyone. One can only surmise that the music was heard as appropriate to 
accompany the visuals. 11 
AUDIO EXAMPLE 16.2 


Scott’s discussion of an Auto-Lite spark plug commercial in 1961 provides 
another justification of electronic sounds: that they could both animate and sup- 
port the images in a commercial: 

Every time you saw the Auto-Lite spark plug, you saw it spark and the explosión 
that followed. We were able to create a distinctive sound that was somewhat 
realistic, somewhat impressionistic and you not only saw the spark and the 
explosión, but the attractive nature of the accompanying sound made a fine part 
of audio identification with that high-point in the video part of this commercial. 
Effecting an impressionistic interpretation of situations can frequently make 
possible a stunningly attractive effect to go with a certain moment in a TV 
commercial where, something that hasn’t got a sound can have a sound riding 
“piggy back,” so to speak, on the general action — which, with exposure, becomes 
the audio-frequency memory of the “spot” in question. (liner notes to Manhattan 
Research Inc. 2000, 135) 


AUDIO EXAMPLE 16.3 
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Despite Scott’s concerted efforts to employ electronic music in commercials 
in ways that were palatable to his listeners — to domestícate electronic music — he 
was less successful in wining national accounts than his main competitor in 
this era, Eric Siday. Scott, an inventor as well as a musician, seemed to be too 
wedded to the sounds his machines could make to become as famous as Siday, who 
conceptualized his electronic sounds first and foremost as acoustic ones. 


Eric Siday: Identitones 


London-born Eric Siday (1905-1976), a jazz musician who emigrated to the United 
States in 1939, began working in the advertising music world in 1949 (Cross 1968, 
59). Unlike Scott, Siday became spectacularly successful. In 1966, Time magazine 
reported that about 80 percent of the U.S. population had heard at least one of 
Siday’ s compositions, making him one of the most frequently played composers in 
the world and one of the best remunerated, as we have seen. Time said that Siday’s 
signature music work was based on the principie that a product would stick in 
the consumer’s subconscious better if accompanied by an “instantly recognizable 
musical trademark” (“Swurpledeewurpledeezeech!” 1966, 68). 

Perhaps the most famous commercial Siday wrote was for Maxwell House 
Coffee in 1959 and first broadcast on August 10 of that year. 12 This commercial 
re-creates the sound of percolating coffee not just as sound but as a tune almost as 
memorable as a good jingle. Hugely successful, the commercial was played on and 
off into the 1990S ( cc ‘Perking Pot’ Theme Back at Maxwell House” 1990). Siday 
claimed, “the Maxwell House ‘perking’ sound is considered one of the break- 
throughs in the field of broadcast logos.” 13 Elsewhere, he described it thus, in a 
manuscript that is undated but likely from the late 1960S: 

[T]his commercial for coffee? — No pictures of cans of coffee. No newlyweds at 
the breakfast table, not so far, even a mention of coffee! ... A character was 
created; and starred. It was a Coffee Pot . . . 

As the camera slowly panned down, it revealed the coffee pot, shiny and 
elegant, perking away, but not with the real sound of coffee perking; each little 
perk had a quasi musical tonality that was a calculated characture [sic] of the real 
sound . 14 

The creation of the character of the perking pot and the use of music to serve as its 
soundtrack helped these sounds gain acceptance. It is likely that many viewers did 
not understand this music to be the product of tape manipulation, particularly 
since it functioned as a musical illustration of moving images. 15 


VIDEO EXAMPLE 16.1 
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Unlike Scott, Siday’ s music seems to have been conceptualized at least in part 
as tonal music, and he wrote at least some of his electronic compositions in conven- 
tional notation, perhaps in order to copyright them. As was the norm in the busi- 
ness at that time (and the present), copyrights were assigned to advertisers. Notated 
scores were usually just a few bars of the most memorable melodic portions of the 
jingle. The notated score to “The Perking Coffee Pot” was a shorthand versión of 
melody, just twelve bars long (plus six more of slow-moving accompaniment), and 
none of the nuances provided by his electronic gear are notated or indicated. By 
contrast, the versión used in the commercial lasts nearly the entire length of the 
sixty-second advertisement. 

With the enduring success of this jingle, which was contained on many demo 
tapes and mentioned in many promotional materials — often as the lead example of 
Siday’s prowess and fame — Siday began to write electronic sound logos for use in 
advertising to identify a product in sound. He also began writing snippets of sound 
to announce a news broadcast or weather repot or sports item in radio broadcasts. 
He called these sounds “Identitones,” an updated term from “Identitunes,” which 
was his earlier, acoustic music venture, begun in 1957 and suspended in 1964. In 
mid-1966 Identitunes became Identitones, Inc., and was clearly an electronic music 
endeavor, as Siday’s standard contract indicates: “Identitones, the originator and 
owner of a copyrighted package of electronic sounds consisting of various sounds 
which may be used in radio and televisión as promotion aids to identify a radio 
and/or televisión stations(s) and its various program features (i.e., weather, news, 
time), hereby gives and grants to the LICENSEE . . .” 16 Siday provided sound logos 
for many major American and multinational corporations such as Westinghouse, 
ABC Radio, ABC Televisión, American Express, and Sprite. Siday wrote in a 1969 
speech called “Electronic Music in Communication,” “I think the most significant 
valué I have found in my use of electronic music is it’s [sic\ application to the logo , 
and I consider this almost an art form in itself.” 17 

Siday composed a sound logo for Westinghouse on October 17, 1963, according 
to the date on the score, which is only six measures (though the static part of it 
ineludes the notation “electronic gliss: [Key of F]”) and for which Siday was paid 
$6,000 for the first six months’ use alone. 18 This became another well-known Siday 
composition that was included on many of his demo tapes, and the copyright saga 
around it reveáis the difficulties of claiming ownership for these new kinds of com- 
positions. Conventionally notated but electronically realized, such compositions 
were a sort of hybrid of electronic and tonal acoustic music, which made Siday’s 
compositions less difficult to accept than Scott’s in this period. 


AUDIO EXAMPLE 16.4 
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Yet Siday ’s electronic sounds were different enough that Siday believed they 
could be used to cut through the advertising clutter of his day, a concern of many 
in this period. “The world is satiated by conventional music. To grasp a listener 
today, you have to give him something new.” For him, the means to this was through 
the “art of miniaturization — saying something that instantly stands for a corpora- 
tion’s personality” (“Swurpledeewurpledeezeech!” 1966, 68). Identitones produced 
fifty “sound images” that radio stations could use to identify themselves. Siday 
thought that the similarity of radio programming made it difficult for listeners to 
tell stations apart. Identitones consisted of station identifications that presented the 
station’s cali letters in a dozen variations, as well as various sounds that could be 
used to introduce weather reports, traffic alerts, and so on. Wrote Time: 

On rainy days, announcers give the weather forecast to the background refrains 
of electronic wind and rain, which comes in three intensities, drying up, drizzle 
and drench. Warm summer nights are depicted by impressionistic bullfrogs 
and nightingales, coid winter days by chilling quivers and twangs. 
(“Swurpledeewurpledeezeech!” 1966, 68) 

Siday says, “It’s all subliminal. The imagination of the listener can run riot”; he 
believed that a You just can’t get a good drenching rain sound with an orchestra” 
(“Swurpledeewurpledeezeech!” 1966, 68). 

Like other early electronic music composers in the commercial realm, Siday 
faced incredulity from some poten tial clients, as measured both by his promotional 
material, which emphasizes time and again the benefits of using electronic music in 
advertising and sound logos, and by some of the feedback contained in his files. 
Siday’s promotional material frequently refers to jingle fatigue, that listeners are 
being “jingled to death.” A document from around 1970 says, “The constant repetí - 
tion of a jingle may quickly amount to boredom, creating a continuous need for 
costly refreshment.” Siday’s Identitones could come to the rescue since their elec- 
tronic sound made them seem fresh: “The wide variety of ideas with uniqueness of 
sound provided by the Identitones technique has proved to have the lowest ‘wear 
factor’ in the industry. There are instances of these sounds still in network use after 
five, eight, or eleven years.” 19 

Moreover, like so many electronic composers, Siday emphasized the variety of 
sounds available to electronic composers: 

There are literally thousands of electronic sounds available that advertisers are 
not aware of, some of which might be valuable. . . . Electronic music offers a new 
spectrum of colors in sound, simply by manipulating known sounds like the 
guitar, a bird cali, or a machine with electronic equipment. Changing the speed of 
the tape, filtering, and editing all help to produce new sounds from oft-used 
sound sources. Puré sounds, made from electronic equipment like the oscillator, 
can also be used, alone or integrated. 20 

Another benefit, which appears in several promotional documents, is that Siday’s 
electronic works (and, of course, those of others) do not require the labor of live 
musicians, which means that this music is cheaper to produce and residuals do not 
have to be paid to singers. 
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Yet skittish potential clients feared that the sounds were too strange. Siday 
received a letter from Robert B. Jones Jr., vice president and general manager at 
WFBR radio in Baltimore, dated November 4, 1965: 

We are very excited about this project, but at the same time realize that it offers a 
tremendous challenge both to you and to us. The very technique of your “elec- 
tronic music” is so unique and different that it becomes terribly important that 
we not get too sophisticated, cute or off-beat in the content. In other words, our 
audience consists of slob housewives like Jane who must be able to both dig and 
enjoy what we’re doing without a degree from Julliard [sic\ or RCA. Since we 
must do this without pictures or lyrics, I don’t think we should shy away from the 
obvious or the corny, provided it’s good corn . 21 

Radio station WFBR became Siday’s first Identitones customer. 

Siday also faced the opposite reaction from those who knew a little about tech- 
nology. In the mid-^óos Siday employed a salesman, Charles Barclay, who visited 
the major media markets in the United States with Identitones demos. A contact 
report by Barclay from January 1967 at station WSB in Atlanta says, “Played tape for 
Elmo Ellis and 2 others (Hill & Shaffer). He liked it but not overwhelmed. He’s 
fooled around with audio oscillator and thinks weTl be quickly imitated.” 22 

Still another issue that emerges from Siday’ s documents concerned communi- 
cation. Siday would customize Identitones packages for customers (for a price, of 
course). However, clients were not always conversant enough with electronic tech- 
nologies to know how to ask for the sounds they wanted or whether what they 
wanted was technologically possible. A letter from a production manager at WQUA 
radio, Moline, Illinois, dated January 24, 1968, requested that Siday modify an 
existing package to match the station’s updating of its campaign: 

We’d like to give a slightly different twist to the I.D.’s ... a longer intro to the I.D. 
itself and a fast stinger [a short piece of music that breaks up a radio program]. 

The intro to build excitement for the stinger which is the actual “taste” so to 
speak. The intro should be long enough for the announcer to say, “Get rid of 
those blues. Have a Taste of the New Life on WQUA (sig). ... I think that the 
major difference in what we want and what we now have is “excitement”; we’d 
like a little more! Kind of like a whirling cloud of sparkling gold dust to lead into 
the sig . 23 

Once electronic sounds began to become normalized and catch on, Siday had cli- 
ents who wanted to convert their acoustic advertising music into electronic. 
A query to Siday’ s publicist resulted in the following response in August 1972: 

Yes, it [the client’s acoustic jingle] could be translated electronically, but there are 
several reasons why I don’t think you should attempt it. 

First it would have to be seo red (same as any musical number). Your original 
charts would help a lot, of course. Then, each electronic element or “instrument” 
would haveto [ sic ] be recorded a single track at a time and mixed through filters, 
spectrum shaper and other electronic devices. This is a long process of trial 
and error. The only simple phase is the final mix which now has a series of 
programmed tracks ready to blend. 
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In other words, it would be as complete as an entirely original score — maybe 
more so because of the specific confines of the theme line. Eric’s minimum fee for 
such a project is $5,000, plus recording charges. 

But even more important is the fact that due to a lack of a simple recurrent 
short musical phrase in the song, Eric doesn’t feel that it will accomplish the 
memorability element that makes the Siday sounds work. The jingle arrangement 
is too sophisticated and subtle, unless an IDENTITONE is put in over the master 
track, and this would be entirely too busy. 

What might be possible, however, is to experiment with simply tagging 
[adding to] that jingle as is with a single IDENTITONE. Cost: $ZERO, 24 

Siday and Scott, working independently as competitors in a growing field, were 
well aware of each other’s work. They faced many of the same problems in attempt- 
ing to promote their music for use in something other than Science fiction. Each 
had to attempt to convince both advertising agencies and advertisers that electronic 
music could serve their purposes without sounding freakish or off putting. Scott 
was the more daring of the two sonically, though with less long-lasting success. 
Siday’s solution was to write compositions that were largely electronicized versión s 
of music that could have been executed by acoustic instruments, though with elec- 
tronic flourishes. Despite Scott’s modérate and Siday’s big successes, electronic 
sounds were still far from being ubiquitous. Later composers still had to grapple 
with ignorant or reluctant clients through the 1970S and into the 1980S, a period in 
which the use of electronic music was still the province of the technically inclined 
and the specialist composer, not yet mainstream. 


Electronic Music in the 1970S 


Siday’s well-publicized successes in the world of commercial music and the fame he 
received from it helped recruit other musicians into the realm of advertising music. 
Composer/sound recordist Bernie Krause recounts the following: 

While studying electronic music at Mills College [in the mid-i9óos], I read an 
article about a man in New York who had been paid $35,000 for seven seconds of 
music for an American Express commercial. Five thousand dollars a second for 
six notes! Impulsively, I bought an airplane ticket and found myself heading east 
to meet this composer. 

“This composer” was Siday, who had purchased the second prototype by Robert 
Moog, which he had used to make that commercial. Viewing the synthesizer on the 
table, Krause became “absolutely convinced I was seeing the future of media music 
unfolding before me” (Krause 1998, 43). 25 

After meeting Siday, Krause and his partner, Paul Beaver, visited Robert Moog 
to purchase one of his synthesizers. Their goal was to write advertising music like 
Siday but on the West Coast, where they were based. Krause found it a difficult field 
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to break in to, for most of the potential clients didn’t want synthesizer music. 26 But 
he had some successes. Krause found that while clients were initially reluctant to 
hire a synthesizer composer, once they dipped a toe in the water, they all wanted the 
same sound that they had heard on pop records (Krause 2009). 

Doubtless partly a result of the massive success of Wendy Carlos’s Switched on 
Bach álbum of 1968 (Carlos 2001 the first classical álbum to generate sales of half a 
million, eventually going platinum) by the late 1960S, electronically generated 
sounds were viewed less as strange and more as novel and interesting. Employment 
of the Moog or other synthesizers became a selling point for advertising music 
composers. Thus, by the early 1970S, the use of electronic sounds was becoming 
increasingly common. Advertising music composers discussed the benefits of new 
technologies, invoking the dominant language of the industry — mood, but not for- 
getting the earlier associations of this music. Edd Kalehoff, a composer-performer, 
said the following in 1973: 

In commercials, I’ve found the Moog adds a strong sense of the energy that takes 
place when music joins picture. If s a special tool that can dramatize certain 
moods, something out of the ordinary, or very strange or very grotesque. If s 
great for electronic effect or logos, and it can replace an entire band, but it isn’t a 
magic instrument to solve all problems and you can’t save money using it to get a 
big band sound. (Owett 1975, 13) 

A Schafer Beer commercial the same year recycles that company’s jingle 
and features Kalehoff playing the Moog synthesizer. The ad introduces Kalehoff 
(“Edd Kalehoff at the MOOG synthesizer”) and shows him playing the instrument, 
turning diais and adjusting the machine in other ways, and opening a can of beer. 
A 1984 ad for Kalehoff s Services in a trade magazine shows him wearing sunglasses 
and sitting at two Moog keyboards; the top of the ad reads “There’s more than one 
way to play it,” above a logo that reads “Moog Synth King.” The ad’s copy sells the 
Moog as much as Kalehoff: 

Play it legato, ostinato, brassy, funky, highbrow or high-tech. No matter how Edd 
Kalehoff plays it, whether it be on this model, which he helped design with Bob 
Moog 15 years ago, or combined with keyboards, rhythm, voices and effects, a 
satisfying sound can be created to enhance any visual. Edd’s classical background 
lends itself to orchestral arrangements which, combined with electronics, 
produce the richly rewarding sound that sells. Edd has composed about 2001 
commercials 8c themes. 

Edd and his staff, with their extensive TV production expertise, are an asset 
to any size project, from an entire TV station music pack to the voice of Q-Bert . . . 

WARNING: THE SURGEON GENERAL HAS DETERMINED THAT EDD 
KALEHOFF’S MUSIC CAN BE HABIT-FORMING. 27 
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Still, synthesizers were novelties in this era. They were used by music technol- 
ogy specialists and employed partly for new sounds and also as a cost-saving device 
for advertising agencies; an article in the late 1970S said that the upright piano was 
still “standard office equipment in the jingle industry” (Gorfain 1979, 51). 


SlJZANNE ClANi: POET OF SOUND 


While Kalehoff might have gained some measure of fame from his use of the Moog 
synthesizer, the musician who became the most celebrated user of electronic tech- 
nologies in advertising music after Eric Siday was Suzanne Ciani (1946-). Ciani, in 
contrast to Siday and many others, used the Buchla synthesizer, which, unlike the 
Moog, did not have a keyboard. 

Ciani told me in an interview that she broke into the business by devising 
sounds for some fifteen-second televisión commercials for Macy’s Department 
Store. She composed these at the studio at Mills College, where commercial work 
wasn’t permitted, so “I took the Scripts back with me and hid in the Mills College 
studio” (Pinch and Troceo 2002, 163). There, she says: 

I got to interpret what I saw poetically in sound. So if it was a commercial for a 
fur coat, I made [what was] to me the sound of a fur coat. If it was a key chain, 

I made the sound of a key chain. ... I looked at these things and saw a different 
dimensión in them. It wasn’t just music, it was some kind of poetic interpretation 
of the visual that was also included. So there were notes, but there was sound 
also. So I loved doing that work. I just, I thought it was very original. And when 
I got to New York, and I was down and out and wanted to work more on my art 
and said, “Okay, you need money.” I thought, “Well, advertising is a good 
resource because I will still be in the domain where I want to work.” And it 
turned out to be a great advantage for me because ... I got to work in the top 
studios with very high budgets, with the top musicians. It was ... a very 
well-supported field. (Ciani 2004) 

Ciani says that it was the cost of owning her own synthesizer in the early days 
of the instrument that impelled her to enter the world of advertising. “Here I was 
making $3 an hour and within a year I had an $8,5000 Buchla. That was the begin- 
ning. But it did necessitate finding a commercial outlet to pay for my habit. So that 
was when I started doing commercials” (Milano 1979, 33). 

Trevor Pinch and Frank Troceo describe Ciani taking her Buchla into a com- 
mercial recording studio with the commercial producers present; they didn’t know 
what to make of the machine. According to Ciani, c Td walk into a studio without a 
keyboard and they’d go, you know, like they didn’t know what to do, how to use it, 
what to write. Some of them just said, ‘Do whatever you want . . . make the sound 
of a spaceship, make the sound of whatever’ ” (Pinch and Troceo 2002, 166; ellipsis 
in original). 
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In an industry that has always employed ordinary nontechnical language to 
describe music — since most advertisers and advertising agencies are not musi- 
cians — the difficulty of finding language to describe synthesized sounds was a 
problem in the early days of synthesizers: 

So you come up with a sound and if you touched one knob, suddenly everything 
was different. And these producers who didn’t know how to talk, nobody had the 
vocabulary for describing sound, he’d say, “No, no, go back, go back to where 
you were.” So I’d move he knob back and he’d say, “No, no! It’s not the same” 
because there were so many interactions — there were maybe fifty knobs contrib- 
uting to one sound. The guy used to hit my hands — whenever he liked it [and] 

I’d move it, he’d say, “Stop! Don’t touch that, don’t you touch another knob! 

Okay, record.” (Pinch and Troceo 2002, 166-7) 

Oíd ideas about what electronic music could illustrate persisted. According to 
an article in the trade press, Ciani was known for a time as a “disease composer” 
because she could make sounds that accompanied various illnesses in commercials. 
Later she was employed to make sounds that were more realistic than an actual 
recording, whether it was the sound of a can of Coca-Cola being opened and poured 
or the sounds of a jai alai match: all synthesized (R. Cohén 1977, 16). Ciani told me 
that for the Coca-Cola commercial: 

I went into the session, and they played the commercial, and they showed me the 
blank space. And I said, “All right, great, but you’ve got to tell me, is this going to 
be used only in this commercial?” “Well, what difference does that make?” “Well, 
because if its going to be pitched in this key of E, then I can design one thing, but 
if you’re going to use it in every key, in every rhythmic context, in every possible 
place, then I have to design it differently.” And they hadn’t even thought of that. 

They thought, “Well, here we are. We’ve got this open space. Just put something 
in there.” And I knew that there was a possibility that it would be used in other 
places. So I personally designed something that had no specific rhythm . . . that 
could fit in any place. The bubbles move chromatically, so they’ re not pitched in 
a key. And ... I used a lot of the white noise because that is not specific to any 
key. So ... I designed it to go in any place it could fit, and it did. They put it in 
every single commercial, all over the world. (Ciani 2004) 

She also said the folio wing: 

When I went in to do the Coca-Cola pop-and-pour sound, and this is typical of 
the desire to make a larger-than-life replica of a real sound like . . . the sound of 
the potato chip being crunched and the bottle opening. They would go and 
record the real sound, and of course it never sounded very good. And it didn’t 
have the punch, it didn’t have the life, it didn’t have any of the qualities that 
they needed. So after they exhausted themselves recording the real thing, 

I would come in and be like a surgeon. I would make sounds, my domain, 
one of my specialties was a very small domain. You know, if I made the sound 
of the potato chip crunching, I might have twenty elements in there: every . . . 
piece of salt flying . . . was something that I designed. And then I could splice 
together . . . the first few milliseconds, and, you know, get a blended sound. 

(Ciani 2004) 
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In these and other commercials, Ciani was developing what she called the 
“poetry of sound,” mentioned earlier: 

So it wasn’t so much the note music as much as it was a poetry of sound — you 
know, what is the sound of a fur coat? What is the sound of a key chain? What is 
the sound of perfume? And developing metaphors in sound. The feeling, you 
know, the feeling you got listening to it. Was it soft and warm? Was it hard and 
coid? You know, so this poetry of sound is what I really brought to the industry. 

(Pinch and Troceo 2002, 163) 

In 1985 Ciani and associates at her company Ciani/Musica talked about a com- 
mercial for General Electric called “Beep,” which illustrates her method of achiev- 
ing this poetry of sound. The company wanted a talking dishwasher. Ciani 
synthesized a sound on one of their synthesizers that synchronized with the rhythm 
of the words on the screen. Later, for other beep spots, they used a Synclavier, one 
of the first digital synthesizers, to sample the original beep and then sequence it: 

They presented the commercial with just a close-up of this dishwasher with a lot 
of blinking lights. And the actual dishwasher made a sound, but the sound was 
just one beep. And they wanted me to re-create the beep of the dishwasher and to 
intégrate it into this commercial. And I said, “Well, gosh, if it’s thirty seconds of 
just listening to this one beep — people will just go crazy. Can’t we do something 
with it?” “Well, I don’t know. Let’s check with the legal department because 
we’re not sure if it would be considered false advertising if the beep were not 
authentic.” And I said, “Well, while you’re checking, I’m going to put together 
what I think it should be.” And, of course, my Buchla synthesizer — one of the 
things that I loved about it was it had this feedback system of lights. So how you 
know what’s going on in the machine is that the lights tell you. If something is 
triggered, the light goes on. If the envelope is decaying, the light fades. If the 
random voltage is fast, the light oscillates quickly. If the sequencer stage is on, the 
light is up that stage, and so on and so forth. So I was very used to this dialogue 
with a language of lights. So I took this little beeping, the lights in the commer- 
cial, and I made the machine have a personality. And it was cute. You know, 
when the lights went up, it went up, and when the lights did this, it did that. 

And it was just really cute. (Ciani 2004). 28 

In an interview I had with her, she became very emphatic: She wasn’t simply adding 
emotion to a commercial: “I had to create a being. I had to create a personality, a 
character. I gave life to this thing. When they had this thing. it was just a machine 
with some lights on it. And by the time it was done, you wanted to hug it” (inter- 
view with Ciani 2004). Employing music to give personality to a commodity goes 
back to the earliest days of radio (see Taylor n.d.), but with electronic technologies 
employed to heighten the realism of what is depicted on-screen, a new mode of 
representation had been achieved. 
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VIDEO EXAMPLE 16.3 29 


CONCLUSIONS 


While Raymond Scott and Eric Siday struggled in various ways to convince poten- 
tial clients that their electronic sounds could be successfully utilized to sell goods, 
effectively domesticating them, Ciani, building on their success, produced sounds 
that seemed to be more realistic and vivid than the actual recorded sounds, a new 
phase in the domesticaron of electronic music. With the “poetry of sound,” the 
classically trained Ciani achieved what few in the world of advertising music had, a 
kind of rarefied aesthetic approach to this music. Using electronic music not 
as a substitute for conventional instruments, she broke firom the past uses of this 
technology — of anthropomorphizing, adding “personality,” or simply presenting 
electronic sounds as those of acoustic musical instruments — to participate in adver- 
tising’s common use of hyperreality to sell goods. 30 Coca-Cola sounds better than it 
does in real life because the sound you hear in the commercial is not simply a 
recording of the can being opened and poured but something that sounds better 
and, as a result, looks better. Ciani took domestication to the next level from her 
electronic music forebears: Not only did she make the product seem better than it 
was, but she also helped create an impression with sound that life could be better 
with a particular product. Beep. 


NOTES 


1 Eric Siday, “Electronic Music in Communication,” speech dated October 1 6, 1967, 
Eric Siday Archive, folder 215; emphases in original, New York Public Library, New York. 

2 For more on Scott as an inventor, see Winner and Chusid, n.d. 

3 Classical composers took up the science-fiction theme as well. In 1961 a reviewer in 
Saturday Review wrote approvingly of a composition by Swedish composer Karl Birger 
Blomdahl. His Aniara: An Epic of Space Flight in 2038 ad, released on Columbia, is 
described as telling the story: 

of a spaceship loaded with refugees from our now radioactive earth. They are 
headed for resettlement in a colony of coid tundra on Mars. Colliding with some 
asteroids, the ship loses its rudder and is forced to head uncontrollably and 
indefinitely into outermost space. At one point the earth, by now a far distant 
star, a blinding sun to the eyes, explodes, and the electronic music doubles the 
keen heartbreak both of the incident and feelings of the galactic-bound 
passengers. (Bowers 1961, 61) 
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4 This was originally presented as Raymond Scott, “Lecture on Electronic Music in 
Radio & Televisión,” Advertising Age convention, Chicago, July 31, 1962. 

5 For histories and useful discussions of the 1950S, see Halberstam 1993, Jezer 1982, and 
May 1989. For a history of the era that focuses on questions of consumption, see Cohén 
2003. 

6 These figures were calculated from the billing figures included in Fox 1997, 
appendix. 

7 See also McMahan 1962. For a good overview of Scott’s work, see Winner 2000. 

8 The problem of “lightness” persisted well into the 1980S; the associations of elec- 
tronic music with Science persisted quite a long time. A New York City advertising music 
producer said in the late 1980S that one of his spots worked because the music had a 
humorous edge to it. “The more contemporary you get with synthesizers, the less humor 
you get” (Rutter 1988, 15S). 

9 This was originally published as “Commercials Go off the Beaten Sound Track,” 
unidentified magazine article ffom December 1960 and reprinted. Around 1960, Scott was 
asked to adopt an “unusual approach” to a commercial for Vicks cough drops and tablets, 
and, for the first time, he employed his “Karloff ’ machine (liner notes to Manhattan 
Research Inc 2000, 119). 

10 See also “Tip Top Jingle Money Makers” (1962). 

11 Thanks are due to Raymond Knapp for this observation. 

12 This latter date comes from a letter from James P. Ellis to Philip L. Tomalin at 
Ogilvy, Benson, and Mather, Inc., September 28, 1960, Eric Siday Archive, folder 56. 
However, the copyright date for the composition is May 24, 1960 (and is found in the Eric 
Siday Archive, folder 5); the assignment of the copyright form is dated June 26, 1959, Eric 
Siday Archive, folder 55. 

13 Eric Siday, “For Phil Shabecoff,” Eric Siday Archive, folder 185. This document was 
prepared for a New York Times article (Shabecoff 1964). 

14 Eric Siday, draft article titled “Musical Identification in Contemporary 
Advertising,” in Eric Siday Archive, folder 223; first ellipsis in original. 

15 There is some confusión around the dates and génesis of this particular commer- 
cial. Despite many claims that it was electronic, its earliest versión seems to have been 
acoustic, with temple blocks, guitar, and bass; Herbert Deutsch has written that Siday 
wrote the commercial on the Ondes Martenot (Deutsch 1993, 15). Robert Moog said in an 
interview that he didn’t know how Siday made those sounds, but it wasn’t with any of his 
equipment (Robert Moog, interview by Trevor Pinch and Frank Troceo, Analogue Music 
Synthesizer Oral History Project 1996-1998, Coll. 640, Series 2: Transcripts box 5, folder 7, 
Moog, Robert, National Museum of American History, Washington, D.C.). An advertising 
trade magazine that included interview comments from Siday said that the commercial 
was an example of musique concréte , the sounds produced by “tempo [sic] blocks and the 
sophisticated editing of tape” (“Audio Logos: An Image in Seven Seconds — or Less,” 1964, 
36). It is not clear whether there were subsequent versions that were electronic. The most 
common versión, which circulates on the Internet (on YouTube and elsewhere), won a 
Clio Award in 1964 and is most likely the one that Siday describes as musique concréte. 

16 Identitones’ basic agreement form, Eric Siday Archive, folder 120. A copy of a letter 
from Edith Hall, ex-secretary to Austen Croom-Johnson, to the New York Department of 
Taxation and Finance, dated June 23, 1964, requests that Identitunes be dissolved since 
Croom-Johnson had died (Eric Siday Archive, folder 114). 

17 Eric Siday, “Electronic Music in Communication,” speech dated October 16, 1967, 
Eric Siday Archive, folder 215; emphasis in original. 
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18 According to a contract dated April 3, 1964, Eric Siday Archive, folder 38. 

19 These three quotations are from an untitled and undated document c. 1970, Eric 
Siday Archive, folder 146. 

20 Siday, “For Phil Shabecoff,” Eric Siday Archive, folder 185. 

21 Robert B. Jones Jr., letter to Eric Siday, November 4, 1965, Eric Siday Archive, folder 
126; emphasis in original. 

22 Charles Barclay, contact report, Eric Siday Archive, folder 139. 

23 John Dombek, letter to Eric Siday and Charles Barclay, January 24, 1968, Eric Siday 
Archive, folder 146. 

24 Walter Collins, letter to Richard Sonntag, August 21, 1972, Eric Siday Archive, 
folder 153; emphasis in original. 

25 Bernie Krause, luto a Wild Sanctuary: A Life in Music and Natural Sound (Berkeley: 
Heyday, 1998), 43. 

26 Bernie Krause, interview by Trevor Pinch, August 24, 1998, National Museum of 
American History, Analogue Music Synthesizer Oral History Project, 1996-1998, Series 2: 
Transcripts, box 5, folder 5, coll. 640. 

27 Advertisement for Edd Kalehoff, Backstage (April 20, 1984), 27; uppercase passage 
in original. 

28 For more on this beep, see Doerschuk 1985. 

29 There is a YouTube video of a production session with Ciani and associates at 
http://www.youtube.com/watch?v=sxUTtüue5RQ (accessed Feb. 15, 2011). 

30 For a useful discussion of hyperreality and music in the advertising world, see 
Kurpiers 2009. 
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ANDREAS FICKERS 


Introduction 


The aim of this chapter is to consider the radio receiver and, more specifically, the 
radio dial as a mediating interface. Usually the radio dial is thought of as mediating 
between the operator — the listener — and the radio stations tuned in by turning the 
dial. Here we look at mediation in a wider sense — between a European regulatory 
regime of frequency allocation and the imagined European broadcasting landscape 
of the listener. My argument thus develops a triangular relationship between the 
rise of a European regime of frequency regulation, the materiality of the radio set, 
and the symbolic appropriation of the European broadcasting landscape. This 
approach requires an analysis of the material, institutional, and symbolic dimen- 
sions of a concrete technical innovation: the calibrated radio station scale. In ana- 
lyzing the iconological and semantic meanings of this technical artifact, I emphasize 
the importance of material objects as sources for a cultural history of technology in 
general and of radio listening in particular. 

Accordingly, it is necessary to combine history of technology approaches with 
media and cultural studies perspectives. This cultural history approach to sound 
technologies allows me to embed the act of radio listening in a broader ensemble of 
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senses (haptic and visual) and thereby underline its effects on processes of what we 
might cali individual mental mapping: the cognitive production of an individual- 
izad representaron of experienced space (Downs and Stea 1977). The appropriation 
of the radio (this is my first thesis) involved the appropriation not only of a technol- 
ogy of communication but also of an imagined space: the ether. What I refer to as 
the “visualization of the hearing experience,” which resulted from the introduction 
of the station scale as a map of an imagined voyage through the ether, downgraded 
the previous practice of explorative listening in search of audible radio stations 
(tuning) while simultaneously prioritizing our “first sense” (seeing) as a new aid in 
radio set operation. This deeply changed the human/machine interaction with the 
receiver and provided the user — who turned from a puré listener to a viewer 
of a topographical representation of European station ñames (mainly ñames of 
cities) — with a new and highly symbolic media interface. Interacting with this 
new interface — as suggested by the iconography of advertisements and reported in 
many literary reminiscences to the station scale — innervated the user’s imagination 
of traveling the European broadcast landscape while turning the dial from one 
station to another. 

In this process of symbolic appropriation (this is my second thesis), changes in 
receiver design can be interpreted as a crucial shift in the interaction with the radio, 
as well as in its “material meaning” (Gitelman 2004). While early radio listening 
required technical skills and a specific tacit knowledge, the emancipation of radio 
from a médium of tinkerers or hobbyists to become a mass médium was bound to 
the development of a user-friendly design that transformed this electrical device 
into a domesticated piece of furniture. Although scholars have covered this domes- 
tication of the radio receiver to some extent (Aitken 1976; Douglas 1987; Hilmes 
1997; Lenk 1997; Méadel 1994), so far they have ignored the introduction of the 
radio dial as a crucial moment of transition in human-machine interaction. I argüe 
that the advent of the calibrated station scale facilitated the black-boxing of radio as 
a complex technology and created the “user illusion” of an easily manipulable 
machine. 

Finally, I develop a third thesis, which is that these crucial changes in the recep- 
tion side of radio, in terms of both receiver design and new forms of usages, carne 
as an unintended consequence of regulatory initiatives on the transmission side. 
Without the emergence of a European regulatory regime, based on the authority of 
the Technical Committee of the International Broadcasting Union, the develop- 
ment and successful implementation of calibrated station scales would not have 
been possible. 

To understand the substantial change in radio receiver usage, it is necessary 
first to briefly trace the early history of amateur radio technology and to describe 
the advent of this new médium as a sensorial revolution. Early literary responses 
to radio serve as sources for a cultural contextualization of the phenomenon of 
“hearing through the ether” and allow me to sketch the contours of how that new 
“art of listening” evolved from broadcast technology. Next I discuss the crucial 
steps in the technological development of radio in the 1920S by focusing on the 
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central role of radio amateurs and the tacit knowledge needed for the operation of 
these new electronic machines. The gradual reduction of tacit skills for radio 
operation involved a successful process of domestication, typical of the cultural 
appropriation of household technologies in the twentieth century (Silverstone 
2006). 

Subsequently I explore the gradual emergence of a European regulatory regime 
in frequency planning, mirroring the need for regulation of the fast-expanding 
broadcasting infrastructure of the mid-i920s. My argument concentrates on the 
role and importance of the Technical Committee (TC) of the International 
Broadcasting Union (IBU) in Brussels, which acted as a technopolitical mediator in 
the establishment of a European regime of frequency regulation. The successful 
promotion of a “technopolitics of accuracy” by the TC paved the way for the 
standardization of European frequencies and enabled — as an unintended conse- 
quence — the successful implementation of calibrated station scales as both technical 
and aesthetic innovations in radio receiver design in Europe. 

In a final section I interpret these changes in receiver design as material inscrip- 
tions of these regulatory efforts. A closer look at the design of the new sets 
with a calibrated station scale highlights the role of design as a mediating interface 
between technology and user, between the hidden structure of the regulated 
European broadcasting landscape and processes of mental mapping and symbolic 
appropriation of an imagined European topography of radio stations. 

The empirical evidence for my claims has been gathered by studying the 
Telefunken archives in Berlin and the IBU archives in Geneva, as well as a selection 
of major radio journals in Germany, France, Great Britain, and the Netherlands. 
The secondary literature used for contextualizing the phenomena discussed ineludes 
classical Germán accounts, as well as French, British, and American studies. 


The Advent of Radio as a 
Sensorial Revolution 


With the emergence of broadcasting, a new quality occurred in the process of the 
world’s spatial and communicative networking (Lenk 1997). From a macrohistori- 
cal perspective, one could describe this process as a change from a visible to 
an invisible networking of the world. While all kinds of traffic routes and cabling 
(electricity, telephone, telegraph) left visible traces in prívate and public spaces, 
wireless broadcasting heralded the age of the invisible but audible networking of 
the world. This process resulted in an increased sensibility for the sense of hearing 
as a producer of cultural meaning. 

Numerous literary sources report the amazement and fascination caused by 
the emergence of radio broadcasting. Writers, poets, columnists, and copywriters 
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celebrated the wonders of radio. Two of the central metaphors used to describe the 
new sensations were cc the world at home” and Cí the ear of the world” (Bleicher 
2000). In his masterful study Radio: An Art ofSound from 1936, the Germán film 
and radio critic Rudolf Arnheim wrote the following: “An apparatus which techni- 
cal peculiarity simply consists in enabling sounds made at a particular spot to be 
simultaneously reproduced in as many and as far removed places as one wishes 
by disrespectfully breaking through boundaries of das s and country, signifies a 
spiritual event of primary importance” (Arnheim 1972, 226). Radio complemented 
the growing possibilities of travel and transportation, and it also invited people to 
travel mentally. As René Schickele, a Germán regional writer, put it in a 1931 article 
titled Paneuropa der Sender> even the farmer in the Black Forest got connected to 
the world: “Today, you can be lonesome without losing the connection to the 
world” (Schickele 1984, 141). It is worth noting that authors often used “world” as a 
synonym for “city.” This equation of the modern world with urban life can be 
found in several radio poems of the 1920S and 1930S. Consider the following lines 
from a 1927 poem by Robert Seitz titled Überall her aus der Welt [From All over the 
World]: 

From all over the world I send you my salute 

From all over the world you were coming to me 

Alaska is on my side and China 

The city on the Seine and the town on the Newa 

Cities in decline and cities on the rise 

Rome and Sao Paulo 

I can hear your voices next to me 

And Fm going through you with your voice 

I walk on boulevards 

And pass the palaces in Granada 

I find myself amidst trafile in seaports 

And Fm all alone on the vast plains of a Southern continent. (Seitz 1984, 42-43) 

As shown by a 1927 essay by Lothar Band, called Hórbilder aus dem Leben [ Acoustic 
Images of Life], the simple announcement of a program from a distant city could 
trigger a feeling of nervous tensión in listeners: 

Some time ago, there was this short message full of weird tensión: “In one minute 
Geneva will be on the air!” Hastily transmitted, these nine words are trembling 
with an infectious excitement. These are words of uncertainty, for we do not 
know if the transmission across a distance of more than one thousand kilometers 
will work — words feeding the hope of becoming an ear witness of an important 
moment in world history . . . When attending such a newsworthy event via 
broadeasting, we not only want to listen, no, we want our ear to see. We want to 
particípate in this event, trying to raise it to the level of a real experience. (Band 
2002, 244) 

As these quotes suggest, it was not only the simple technical possibility of 
sending sounds through the ether that aroused people’s imagination. There was 
also a real emphasis on the revaluation of hearing and the ear as a producer of 
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cultural meaning. 1 In a 1925 árdele titled Die Denkumschaltung durch den Rundfunk 
[The Thought Switch Caused by Broadcasting], Christian Gerlich States the 
following: 

For thousands of years our thinking has been based on seeing. Then, suddenly, 
the electrical wave carne upon us like lightning, the unseen. Incidents in the ether 
that remained secretive and mysterious because we couldn’t see them suddenly 
became unraveled by our ears. Another of our five senses is taking over, and we 
have to switch from the eye to the ear. (Gerlich 2002, 210) 

A similar position can be found in an árdele on the “aesthetics of broadcasting” by 
the famous Germán novelist Arnold Zweig. Likewise, in a piece by Richard Kolb on 
the “horoscope of the radio play,” the microphone mutates into the listener’s ear, 
and the “announcer and listener meet at the shared focal point of mental acoustics” 
(Kolb 2001, 32). All of these authors share the same enthusiasm for the positive 
power of imagination caused by radio listening. With and through radio, so they 
claim, the ear has become a serious rival to the eye. a Hear thinking” has finally 
challenged the modern practice of “eye thinking”! This power of radio to foster 
cognitive visual and spatial imaging through the internal production of “acoustic 
images” is a crucial aspect of radio. As Susan Douglas puts it, “The magic comes 
from entering a world of sound and from using that sound to make your own 
visión, your own dream, your own world” (Douglas 2004, 28). 


From Tinkerer to Mass Médium: 
Functional and Symbolic Change 

of Radio Use 


When taking into account the real acoustic output that these early beeping and 
screeching apparatuses produced, it is quite surprising to see such terrific fantasies 
and expectations surrounding the emergence of the new médium. In addition to 
the mediocre sound quality of early broadcasting, the search for stations was a plod- 
ding affair that needed the full attention and concentration of the radio operator 
and the complete silence of the attending public. This often lengthy procedure was 
not always successful, and early critics of radio described the “surfing” on the radio 
waves in the ether — the long search for an acceptable signal in the wireless frequeney 
spectrum — as an abnormal phenomenon. The Germán philosopher of religión 
Johannes María Verweyen called this disease “radiotis” and lamented in 1930: 

Radiotis has caused a perturbation of the inner balance that is popularly called 
Drehkrankheit [tuning malady]. This forebodes to a certain pathological affection 
or even addiction to make as many turns as possible without paying any attention 
to the changing contents. (Verweyen 2002, 454) 
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Individual station adjustment (fine-tuning) was of central importance in the first 
decade of radio broadcasting, that is, until the emergence of calibrated station scales 
in the late 1920S. The fine-tuning adjustment called for a concentrated dedication of 
haptic and acoustic senses, or “fine motor skills” (Schmidt 1998, 316). It is impos- 
sible to underestimate the tacit knowledge needed to opérate a receiver. To explore 
the broadcast spectrum, many circuits had to be adj usted individually by rotating 
the control knobs sequentially in small increments until a station was heard. Next, 
several precise readjustments were necessary to bring in the desired program with 
clarity and volume (Harrison 1998, 296). The gradual development of automatic 
tuning devices, then, was an indispensable condition for the transformation of the 
radio set from a tinkerer’s or an amateures médium to a mass médium. Central 
steps in this process were the embedding of previously exposed components into a 
cabinet, the reduction of tuning knobs, and the integration of the loudspeaker into 
the cabinet. From 1924 onward, there were more than seventy patent applications 
for single-control tuning in the United States alone (Harrison 1998, 299-300). While 
the radio industry sought to produce sets for everyone-they believed that radio 
listening should be simple, and the technological visión of the time was a receiver 
that could be operated with only one knob. 

A 1930 advertisement by the British radio manufacturer Murphy, which read 
“We were not blessed with three hands,” shows that simplifying the operation of a 
receiver was a central aim of the receiver industry in its effort to win over more of 
the public, especially women, as potential buyers. 2 In a Telefunken note dated 1927 
one could read the following advice to the licensed dealer: “Please consider that 
most of your customers are no experts and that they cannot afford some sort of 
‘radio chauffeur’ to opérate their receiver.” 3 In a 1934 issue of the French journal 
Radio diffusion, one read this prediction: “The single-knob receiver will lead women 
to broadcasting, just as the electrical ignition led them to the automobile.” 4 In The 
Setmakers , British broadcast historians Keith Geddes and Gordon Bussey arrive at a 
similar conclusión: “With the introduction of the automatic volume control, the 
adoption of the superhet principie, and the demise of the oscillation-provoking 
reaction control, the taming of the domestic receiver was complete by the early 
1930S” (Geddes and Bussey 1991, 107). As Roger Silverstone and Leslie Haddon have 
argued on the basis of Adrián Forty’s study of the English radio cabinet design 
between 1928 and 1993 (Forty 1972), the simplified user interface of the radio set 
since the mid-i920s is a fitting example of what they cali “pre-domestication”: “an 
anticipation in design itself of the artifact’s likely place (in this case the home) and 
an attempt to offer a solution in the design of the object itself to the contradictions 
generated within the process of technical innovation” (Silverstone and Haddon 
1996 , 49)- 5 

The gradual simplification of the operation of the radio set since the mid-i920s 
can be interpreted as the first successful example in the still young history of elec- 
tronic consumer goods to enhance the user friendliness and create — through the 
intervention of engineering design and the exterior of the radio — a “user illusion.” 
This implied that the user mastered the device without in fact having any 
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understanding of its technical functioning. 6 The creation of this “user illusion” 
accompanied the development of a desktop philosophy by industrial designers who 
were confronted with the difficult task of creating functional simplicity in the face 
of structural complexity (Betts 2004). In this process of functional and symbolic 
transformation the apparatus became a piece of furniture whose inner mechanism 
was successfully hidden (Carlat 1998; Forty 1972). In the words of philosopher 
Norbert Bolz, the concealment of the technical interior of a machine — or, in other 
words, the black-boxing of technology by the development of user-friendly design — 
is “the rhetoric of the technology which consecrates our ignorance. And this design - 
specific rhetoric pro vides us with the user illusion of the world” (Bolz 1998). 

Next to the single-knob operation ideal, the radio industry endeavored to 
develop and intégrate the electrodynamic Hat speaker into the cabinet — a process 
that reminds one of the domestication of the phonograph some twenty-five years 
earlier. In an article titled “Die Wesensform des Lautsprechers: Ein Beitrag zur 
Ásthetik der Technik” [The Essence of the Loudspeaker: A Contribution to the 
Aesthetic of Technology], Ewald Popp, a Czechoslovakian loudspeaker specialist, 
describes the wonder of the flat speaker and refers to the importance of the loud- 
speaker as “contact person” for the listener: “We hear the loudspeaker, but we see 
him, too. Just as we look into the face of our contact person, we involuntary direct 
our glance to the loudspeaker. He is the acoustic center we look at — we have to look 
at [him] because our senses require it” (Popp 2002, 312). In other words, the eye 
“listens”! The integrated electrodynamic flat speaker and the emergence of the sta- 
tion scales (see later) mark the beginning of what I cali the return of Schaudenken , 
thinking by looking or, in other words, the increasing visualization of the hearing 
experience. 


The Emergence of a European Radio 

Infrastructure 


The emergence of a regulated broadcast spectrum in the mid-i92os served as the 
technical precondition for the development of calibrated station scales. Without a 
fixed frequency plan that allotted specific wavelengths as either exclusive or shared 
frequencies to specific stations, the calibration of receivers and the design of a radio 
dial with fixed station ñames on it would have been impossible. As I demónstrate 
later, however, this development actually carne about as an unintended conse- 
quence. The real motive for the establishment of a European regime for frequency 
regulation lay in dealing with the unexpected pace of growth of broadcasting as a 
new médium in the 1920S. Radical innovations in the tube industry, a mass of 
excited radio amateurs, and the discovery of the educational and entertainment 
potential of broadcasting caused a genuine radio boom in both Europe and the 
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United States. At the same time, the great success of radio broadcasting also created 
a serious threat to the new médium. The uncontrolled crusade of the airwaves soon 
became a severe concern, and critics like Rudolf Arnheim started to complain about 
the problems of interference and the rising “chaos” in the ether: 

What we hear today from the loudspeaker is an artistically forceful Symbol of 
constant war in peace, of the deficiency of central authority which we permit 
around us — a chaos concretized in discord and as such directly perceptible to the 
human ear. So the dance of music of one country comes through the funeral 
march of another. (Arnheim 1972, 238) 

To tackle this chaos, broadcasting became the subject of intense political and 
diplomatic activity at both the European and the international level. Such coopera- 
tion was indeed badly needed, as Arno Huth, author of the impressive survey of 
international broadcast activities titled La radiodiffusion: Puissance mondiale (1937) 
convincingly argued: 

Before the international cooperation there was chaos. Each country allotted 
frequencies following their national needs and without taking care of the 
transnational nature of broadcasting signáis. As a result, radio reception became 
an agony, and the listener suffered from constant interferences and turned into a 
victim of the peeping, cracking, crunching, and groaning sounds, of which he 
retains dreadful memories. (Huth 1937, 45) 

The first European initiative to tackle this problem of increasing interference 
in radio broadcasting was directed by Swiss radio amateur Maurice Rambert. In 
October 1922 Rambert had obtained the first license for radio broadcasting in 
Switzerland. As one of the early radio amateurs, he personally suffered from the 
growing disorder in the ether, and in 1923 he and his friend Dr. Edmond Privat, 
president of the Bureau International de EEspéranto in Geneva, initiated the first 
European meeting of broadcasters in Geneva (UIR 1944). This conférence prélimi- 
naire pour une entente Internationale en radio-téléphonie took place on April 22-23, 
1924, and was attended by some forty representatives of State administrations, 
private associations, radio clubs, the radio industry, and the wireless press. This 
preliminary conference resulted in a number of resolutions. Among other things 
the participants claimed an exclusive part of the frequency spectrum for radio 
broadcasting and for radio amateurs and also established a permanent organiza- 
tion for broadcasting regulation and cooperation in Geneva. Maurice Rambert, 
elected as chairman of the provisional executive committee, was charged with orga- 
nizing a second preparatory meeting at the British Broadcasting Company (BBC) 
in London (March 18-19, 1925), where agreement was reached over the general 
lines of the future organization. On April 3-4, 1923, the Union Internationale de 
Radiodiffusion/International Broadcasting Union (UIR/IBU) was finally launched. 
The BBC’s deputy managing director, Rear Admiral Charles Carpendale, became 
president of the First General Assembly, and his colleague Arthur R. Burrows, pro- 
gram director of the BBC, served as first director of the IBU’s permanent office in 
Geneva. 
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From the outset, Geneva was seen as the ideal headquarters because of its repu- 
tation as host to numerous international organizations, most prominently the 
League of Nations, the Organisation International de Travail (OIT), the Bureau 
International de l’Espéranto, the International Committee of the Red Cross — and, 
of course, the International Telecommunication Union (ITU). Symbolically, the 
foundation charter of the Union International de Radiodiffusion (UIR) was signed 
at the offices of the League of Nations building, a ceremonial performance that 
attested to the visión of broadcasting as an instrument of peaceful purposes 
(Tomlinson 1938). All of the new UIR members — despite what their diverse in ten - 
tions and interests may have been — were convinced of the exceptional importance 
of radio broadcasting as a political and cultural instrument. Despite the alarming 
tendencies toward the national or nationalistic deployment of the médium for ide- 
ological or propagandistic purposes, the central UIR actors were driven by a human - 
ist ideology and conceived of radio broadcasting as an unprecedented instrument 
for the promotion of international understanding and the transcendence of eco- 
nomic classes, political ideologies, and territorial boundaries. Exactly one year after 
the founding of the UIR, all of the members signed a “gentleman’s agreement” 
against the illegal and immoral use of radio as an instrument of propaganda. In the 
ensuing years, several bilateral and international conventions promoting the “moral 
disarmament” of the ether would be signed (Huth 1937, 386-91). 

The cióse link between the technical and the ideological or political self-image 
and mission of the early European initiatives to harmonize the European ether is 
telling. This may be confirmed by a consideration of the development of the IBU’s 
organizational structure. After the creation of a Technical Committee in December 
1925 to prepare a document to serve as a discussion base for a frequency plan, a 
permanent Juridical Committee was founded in March 1926. This was followed 
three months later by the establishment of the Committee of Intellectual, Artistic, 
and Social Rapprochement and, in July 1928, by the Committee of International 
Relays. 7 This organizational structure mirrors the intertwinement of the material 
(technical), institutional (juridical), and symbolic (intellectual, artistic, and social) 
dimensión of broadcasting as a critical infrastructure. 


Planning the Spectrum: The Emergence 

of a European Frequency 
Regulation Regime 


The first and most effective way to harmonize the European ether was to develop 
and agree on a frequency plan that allocated specific frequencies to broadcasting 
stations. The self-imposed task of the IBU was to organize the frequency spectrum 
reserved for radio broadcasting in such a way as to minimize parasitic interference 
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and to maximize order by allotting specific wavelengths to specific national broad- 
casting Services. In his study “Broadcasting without Frontiers,” published on behalf 
of UNESCO in 1959, George Codding argües that the “European experiment” of 
assigning individual frequencies to individual broadcasting stations was character- 
ized by three phases or approaches (Codding 1959). The first attempts that led to the 
frequency plans, known as Geneva (1926) and Brussels (1929), brought about 
voluntary cooperation between the various operators of broadcasting Services 
(prívate and public) under the auspices of the IBU. The second approach, inspired 
by Czechoslovakia, aimed at making frequency distribution more official by con- 
vening a special European conference of broadcasting representatives of national 
governments. This conference, held in Prague in 1929, produced the so-called 
Prague Plan. Third, the IBU itself convened two conferences of administration rep- 
resentatives (Lúceme in 1933 and Montreux in 1939) aimed at assigning European 
broadcasting frequencies in conformity with frequency allocation tables established 
by the International Telecommunication Union in Madrid and Cairo (Codding 
1952, 92). 

The various initiatives regarding regulation of the frequency spectrum for radio 
broadcasting were based on the principie of allocating each European country at 
least one exclusive frequency and several shared frequencies, proportionally allot- 
ted depending on the number of existing stations, the topography and size of the 
country, and its economic and cultural development. These factors resulted in a 
formula for calculating each country’s share of the spectrum. As Nina Wormbs has 
shown, the result — the number of allotted wavelengths per country — largely 
depended on the definition of Europe’s size: The more countries were taken into 
consideration, the fewer exclusive frequencies could be allotted (Wormbs 2008, 
116). This problem became evident during the negotiations for the Lúceme plan, 
when a large number of eastern European countries — including the Soviet Union — 
joined the IBU. The various plans were therefore the result of technopolitical diplo- 
macy: trying to combine the radio engineers’ rational grounds with the hidden 
or explicit political agendas of the national telecommunication authorities and 
broadcasting organizations. 

The establishment of the permanent Technical Center was an important 
step in institutionalizing the IBU as the “ether pólice.” The Technical Center 
became the headquarters of the Technical Committee and soon served as the IBU’s 


Table 1. The steady growth of both the number of stations and the transmitter 
power was a continuous challenge for the regulation of the European 
spectrum of broadcasting frequencies. Source: Codding (1952, 96) 



Geneva Plan 

Prague Plan 

Lúceme Plan 

Montreux Plan 

Year 

1926 

1929 

1933 

1939 

Number of stations 

123 

200 

257 

35 i 

Total power in kW 

116 

420 

3.260 

10.790 
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technoscientific conscience. In the eyes of the Technical Committee, the main 
remedy for the chaotic situation in Europe was the accurate control of the broad- 
cast stations’ transmitters. This was made possible by a newly designed frequency 
meter, calibrated to the frequency allotted to each station by the Geneva plan. One 
year after the creation of the IBU’s permanent Technical Center in Brussels in 1926, 
more than fifty frequency meters had been ordered by various broadcasting organi- 
zations. The frequency meter, which became the materialization of a regulatory 
regime, was successfully implemented by the IBU’s Technical Committee. Its 
politics of accuracy helped to significantly reduce the interference problem and 
established an effective control mechanism. The frequency meter as the material- 
ization of a technology of transmission control found its material equivalent in the 
calibrated station scale on the reception side. According to Sheila Jasanoff and 
Brian Wynne, the emergence and implementation of the frequency meter and the 
calibrated station scale are examples of the coproduction of a regulatory regime 
and technical artifacts. Broadcasting as a new “social project” had shored up the 
legitimacy of both the newly produced natural knowledge (and its material mani- 
festations) and the political order of the regulatory regime (Jasanoff and Wynne 
1998, 16). 

However, another — and in the eyes of the Technical Committee more prob- 
lematic — difficulty was that even IBU member organizations that had ratified the 
plans often ignored the agreements or failed to apply them accurately. This tensión 
between the de jure regulation and de facto violation of the European broadcast 
space demonstrates the critical status of the IBU as a regulatory body. As a nongov- 
ernmental organization, the IBU had no real political or juridical power to press 
its members to enforce the agreement, let alone to exert any kind of pressure on 
nonmember organizations (figure 17.1). 

A positive development, however, was the increasing adjustment of the trans- 
mission characteristics at a distance by means of telegraph correspondence (and 
later by telephone). When the staff of the Technical Center in Brussels detected 
serious distortions of or variations in the power and/or frequency of the transmit- 
ted signáis, they immediately contacted the station’s technicians (either by telegram 
or a phone cali). From 1926 on, this very practical and effective form of réglage a 
distance between the Technical Center and broadcast stations all over Europe 
became one of the main concerns of the technical staff in Brussels, reinforcing the 
IBU’s status as the technical authority in radio broadcasting regulation. The techni- 
cal advances in and the accuracy of frequency measurement in Brussels helped to 
establish a regime of soft regulation based on the expertise and political neutrality 
of the Technical Center. This was clearly the credo of Raymond Braillard, chairman 
of the TC and Director of the Technical Center in Brussels, who believed that the 
constant progress of Science would help to surmount transitory difficulties if it 
were accompanied by strict technical application. 8 While this belief in techno- 
scientific rationality and its inevitable practical success was widespread in the inter- 
national community of radio engineers and broadcast technicians, it was constantly 
challenged by “irrational” interference of a political or an ideological nature. 
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From Technical Instruction to 

SCIENTIFIC UNDERSTANDING 


From a long-term perspective, one is tempted to see in the different plans just tem- 
porary cures of symptoms, not of the actual European frequency disorder itself. 
The regulatory initiatives were constantly challenged by both technical and political 
changes. At the technical level, the gradual expansión of the frequency bandwidth 
used for radio broadcasting (long, middle, and, in the early 1930S, short waves) and 
the continuing expansión of transmitter power reduced the half-life of the plans to 
less than a year. Moreover, the ongoing diplomatic and political struggles on the 
territorial definition of the European broadcasting space foiled the TC’s techno- 
cratic hopes that the frequency allocation could be managed by the application of a 
scientific formula. Yet from a critical historical perspective, it would not be fair to 
interpret or even judge the “success” or “failure” of a European regulatory regime 
only on the basis of the stability or life cycle of the different frequency plans 
(figure 17.2). 

When looking at the activities and Ímpetus of the Technical Committee 
specifically and of the IBU in general, one can identify a clear focus on the study 
and application of precise technical parameters for a smooth European and inter- 
national operation of radio broadcasting. The regulation of the European ether 



Figure 17.2. Amelioration of the frequency stability of European broadcasting sta- 
tions (exclusive frequencies) between 1929 and 1932. Source: “MEMORANDUM sur 
la situation actuelle de la radiodiffusion européenne au point de vue des interferences 
et les moyens propres a y remédier.” Bruxelles: Publications techniques de PUnion 

Internationale de Radiodiffusion No. 1, 1932, p. 26. 
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COMPARAIS ON DE LA 5TABILITE DE LA EREQUENCE DE5 5TATI0NS DE RADIODIFUSION 

AUX ETAT5 UNIS ET EN EUROPE 


ETAT5 UNIS 1349 Stations) 



ECART MOYEN 
PAR RAPPORT A LA 
FREQUF.NCE NOMINALE 


EUROPE (135 Stations) 



COMMISSION TECHNIOUE ET CENTRE DE CONTROLE ot L.U.I.N 


u.r.R. 2 


Figure 17.3. Comparison of frequency stabilization USA / Europe. The white area 
indicates the numbers of stations working with a very high degree of frequency 
stability (less than 0.1 % of deviation from the calibrated norm). 

Source: IBU Archives, Geneva 1932. 


seemed primarily a problem of adequate description and measurement of the tech- 
nical criteria of radio broadcasting (frequency meters and study of origins of inter- 
ference) to be folio wed by a rigorous application and enforcement of these rules. As 
figure 17.3 shows, this politics ofaccuracy — prominen tly propagated by the Technical 
Committee and skillfully customized by the Technical Center in Brussels — has been 
extremely successful and undeniably produced tremendous positive effects (reduc- 
tion of interference). As comparison to the situation in the United States shows, 
this politics of accuracy may be interpreted as a real European technopolitical 
success story. 9 

The negotiations during the development of the Lucerne plan demónstrate 
that the technical norms and standards developed by the Technical Committee 
had been accepted and adopted by all of the negotiating parties regardless of the 
governmental, prívate, or corporate status of the IBU member organizations. 


VlSUALIZATION OF THE LlSTENING ACTI 

The Invention of the Station Scale 


Not only did the emergence of a real European transmission network cali for rigor- 
ous regulation of the frequencies utilized by the radio stations; it also represented a 
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technical challenge for the receiver industry. The only way to guarantee reasonably 
good reception in the densely populated ether was to impro ve receiver selectivity. A 
real milestone here was the development of the so-called superheterodyne circuit, 
which was based on the physical principie of mixing, or heterodyning, a received 
signal to a fixed intermedíate frequency that can be more conveniently processed 
than the original radio carrier frequency (Witts 1935). In many countries, these 
receivers were called “superhets,” or simply “super.” Because of their excellent 
selectivity, they were the first sets that allowed the installation of calibrated station 
scales. From 1931 on, leading set manufacturers promoted their automatic adjust- 
ment circuits, which offered the “highest selectivity” and “perfect single-knob oper- 
ability.” The Schwarzwálder Apparate-Bau-Anstalt (SABA) dubbed its 1931 model, 
SABA 41 W, “the signpost to the frequency jungle!” (Ketterer 2003, 90). The first 
superhet with a calibrated station scale on the Germán market was the Mikrohet W, 
manufactured by Stassfurt in 1928. This model had such success on the market that 
other producers immediately copied the idea (figure 17.4). 10 

From 1933 on, the station scale became a standard function in mid-price-range 
receivers. The mechanical realization of these station scales was hardly a simple 
matter, however. Because of many neighboring stations in the narrow frequency 
bands, a complicated mechanical construction was necessary to produce the optical 
illusion of a scale with evenly spread stations. The conversión of a turning move- 
ment on the station selection knob into an automatic fine-tuning of the electrical 
circuits was a real masterpiece of electromechanical achievement. The optical solu- 
tion of this problem reminds one of the invention of nonscaled maps for the London 
underground developed by electrical engineer Henry Beck in 1933 (Schlógel 2003, 
102). The new underground map owed its success to its simplicity and clearness, 
constructing a symbolic nearness or connectivity of the suburbs with the city center. 
In other words, Beck’s map brought the suburbs closer to the center. Instead of a 
realistic representation of the daunting bigness of greater London, it converted the 
real geographical distance into an invitation to the city (Vertesi 2008). Similarly, the 
radio station scale converted the fascinating but unfathomable “ether” into mean- 
ingful places (see figure 17.5). 

The development and introduction of the calibrated station scale marked an 
important innovation in the technical improvement of the receiver, as well as in 
radio design. The technical solution of the “scaling problem” consisted in a com- 
plex counterbalancing coupling of the visual station indicator with the rotary 
capacitor responsible for selectivity (Rhein 1935, 97-143). Around the mid-i920s, 
sets that used optical elements for the indication or recovery of stations sporadically 
appeared, for example, numerically gauged adjusting knobs or so-called microscales, 
little panels that showed a small cutout of the scale (Friemert 1996, 47-49). 
These rapid innovations in the radio receiver industry — the pace was set by the 
seasonal rhythm of radio trade fairs (Riedel 1994; Bressler 2009) — sparked harsh 
technical and Creative competition among the set manufacturers. In 1930 one of the 
major players in the European radio industry, the Germán Telefunken Society, 
acquired the patent and licensing rights to an invention by French inventor 
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Figure 17.4. Advertisement for the radio set “IMPERIAL júnior” from the Germán 
firm Stassfurt, promoting the set as “the receiver with the highest selectivity 
in the world,” “real one-knob operability,” and “calibrated station scale with 80 
European stations.” Source: Ketterer, Ralf. Funken. Wellen. Radio. Zur Einführutig 
eines technischen Konsumartikels durch die deutsche Rundfunkindustrie 1923-1939. 

Berlin: VISTAS Verlag, 2003, P- 79- 


Joseph-Louis Routin, who had realized a skillful solution to the “scaling problem” 
and distributed it under the label of “Valundia system” (figure 17.6). 

Telefunken offered Routin a consultancy contract, arranging for a yearly visit 
to the company’s laboratories in Berlin. The patent and licensing contract assigned 
Telefunken the rights to produce and to commercialize three of Routin’s inven- 
tions: the conical scale called Nomina, the fine-tuning adjustment for capacitors, 
and a plug connector for an illuminated scale. 11 From 1931 on, Routin’s Valundia 
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Figure 17.5. “No searching - just dial.” This was the slogan of the NORA ad from 1931. 
Source: Telefunken Archives, Deutsches Technikmuseum, Berlin. 
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Figure 17.6. Drawing of the Valundia-System in the patent application of Joseph-Louis 
Routin (1930). Source: Telefunken- Archives, Deutsches Technikmuseum, Berlin. 


system was commercialized by Telefunken under the ñame of Auto-Skala 
(figure 17.7). 

The station scale is advertised as a signpost to or an atlas of the most important 
European stations. Unlike most other models on the market with fixed station 
scales (mainly in the form of printed scales placed on the front of the receiver), the 
Auto-Skala provided the listener with an opportunity to react in a flexible manner 
to the common changes of frequencies. In other words, the Auto-Skala was an 
interactive device that enabled users to customize the scale to reflect their listening 
habits. The booklet that carne with the Auto-Skala stated the following: 

The basic newness of the Auto-Skala consists in the exchangeable station 
ñames. Henee: no dead figures, no tuning tables, no searching but easy reading 
of the station ñames. No schematical listing of all irrelevant stations in Europe 
and no obsolescence because of frequeney changes, but comfortable adoption of 
your favorite stations and flexible adjustment of the terms . . . for all station 
ñames are written on changeable labels easy to replace when a station changes 
its frequeney . 12 

The text of this advertisement draws our attention to a dimensión too often 
neglected in the history of technology: the user (Oudshoorn and Pinch 2005). 13 
Analysis of the discursive strategies detectable in both the iconographic and the 
semantic dimensión of various advertisements promoting the new station scale 
gives rise to several observations. The first is that most of these texts catered to a 
female public. As argued earlier in this chapter, the introduction and promotion of 
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DIE AUTO-SKALA 



DIESES BLATT SOLL IHNEN HELFEN 

die wichtigsten Sender auf Ihrer Auto-Skaio erstmalig zu 
bestimmen. Haben Sie einen der auf dem Bild eingezelchneten 
Sender eingesfellt, so dient das Bild ais Wegweiser, an welcher 
Stelie ungefahr die wichtigsten europáischen Sender herbar 
werden. Die Sender folgen, wie Sie $ehen,in der gleichen Reihen- 
folge aufeinander wie auf dem beigegebenen Senderverzeithnis, 
nur daft das Senderyerzeichms auch die dazwischen gelegenen 
Statíonen enthált. Sie brauchen dieses Hilfsmittel nu^bís Sie die 
Sendernamen erstmalig aufgesetzt haben; dann ermoglicht 
Ihnen die Auto-Skala dasWiederauffmden ohne ¡edes Hilfsmittel. 

Figure 17.7. The Auto-Skala of Telefunken (1931). 

Source: Telefunken Archives, Deutsches Technikmuseum, Berlin. 
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the station scale must be embedded in a process of the “domesticaron” of the radio 
receiver. In this process, described as a process of functional and symbolic appro- 
priation of the apparatus, the improvement of the set’s user-friendliness can be seen 
as the radio industry’s central motivation since the mid-i920s. This shift in the 
identity of radio as médium (from tinkerer’s médium to mass médium) was closely 
linked with the attraction of a massive new user potential represented by children 
and, in particular, women. For instance, the front page of the booklet advertising 
the four 1931 models introduced with the new Auto-Skala for the Germán market is 
decorated with an elegant Chínese statue: A woman who radiates self-confidence, 
Eastern grace, and a hint of exoticism looks down on the receiver. On the following 
page, a photograph shows a bourgeois living room with a chess set in front of the 
receiver; its caption reads: “In every elegant living room, the Telefunken receiver 
with Auto-Skala is an organic part of the furnishing. Its artistic shape perfectly 
harmonizes with every style.” The portrait of a hand easily changing a label with a 
station ñame is clearly a female hand. A Dutch Telefunken advertisement of the 
same year promotes the company’s receivers with the slogan “In your own environ- 
ment” and emphasizes the fact that the station scale directly connects the listener 
with all European stations. 14 Despite this undeniable attention to the female listener 
in advertisement strategies, the construction of gender is not limited to women. In 
fact, many radio advertisements in this period demónstrate a well-balanced mix- 
ture of female and male elements. While aesthetic and functional adjectives 
addressed women as the users of the appliance and as competent managers of the 
household, technical information about the number of circuits, tubes, and the 
material basis of the chassis were meant to appeal to the juvenile or male radio 
connoisseur. 15 

While advertisements as a specific category of sources are perfectly suitable 
for a discursive analysis of gender construction (Smulyan 1994), the analysis 
of the appliance as material source offers additional perspectives. Here, the role of 
design as a mediating interface between technology and the user becomes 
crucial. As the British broadcast historian Paddy Scannell (2004, 86) argües, 
“the mediating stage in the transition from technology to domestic equipment is 

design Design is essential to the transformation of user-unfriendly technologies 

that only trained experts can use into simple user-friendly things that anyone 
can use.” 

As I argüe elsewhere (Fickers 2007), we can identify at least five dimensions or 
functions of design when looking at the radio set. First, design comes with a “gender 
script.” As Madeleine Akrich, Nelly Oudshoorn, Trevor Pinch, Ellen van Oost, and 
others have argued, technical artifacts can be analyzed as representations of gender 
constructions inscribed into the materiality of the technical object (Oudshoorn and 
Pinch 2005). Parallel to the concept of “gender script,” Steven Woolgar’s concept 
of the design process as a struggle to configure the user offers an alternative inter- 
pretaron of the role of design. For Woolgar (1991), configuring is the process of 
identifying putative users. The writings of Joseph-Louis Routin are a striking exam- 
ple of how inventors and engineers have configured the potential users of their 
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inventions. In a letter dated January 27, 1931, to Telefunken patent lawyer Levy, 
Routin explained: 

The radio dial with its cluster of ñames of stations is especially appreciated by 
women and children, who have an instinctive horror of numbers. It is useless to 
speak of lambdas, wavebands, frequencies, or kilohertz to them. But it is exactly 
this clientele which is of great interest to the radio industry as it represents the 
largest segment of the population and is most interested in domestic family 
distractions. 16 

Second, another function of design is that of being an agent of standardization 
of mass production and mass consumption. The radio as “Fordist lead product” 
(Wittke 1996) played an important role in the aim of industrial designers to “cultí- 
vate” and “aestheticize” mass products in the capitalist economy (Kónig 2003). In 
this sense, a third function of design is of course to medíate between function and 
form. As philosopher Ernst Cassirer wrote in his Form und Technik in 1930, all intel- 
ectual examination of the world is based on a double process of Fassen : grasping in 
the sense of theoretical comprehension and grasping in the sense of physical shap- 
ing (Cassirer 1995, 52). This double process of grasping is exactly what happens in 
the present case: the connection between the symbolic appropriation of the world 
(reading of the station scale) and the physical grasping of the material world 
(manipulation of the radio set). Next, and closely related to this, there is the mean- 
ing of design as symbol or language. Philosophers such as Roland Barthes and Jean 
Baudrillard have emphasized the semantic quality of the objet signe , and “product 
semantics” is nowadays an established field in design theory. 17 Finally, the fifth 
function of design is the already mentioned function of concealing technology in 
order to create the “user illusion.” As demonstrated earlier, all of these dimensions 
or functions of design are evident when looking at the radio set as a mediating 
interface. 


Conclusión: On Station Scales, Mental 
Maps, and Imagined Topographies 


One of my theses in the introduction was that the ñames of cities on station scales 
can evoke something like a mental map in the imagination of the radio listener. In 
Mapping Cyberspace , Martin Dodge and Rob Kitchin assert that the geographical 
space is superposed by an imagined space that allows people and organizations to 
react in a flexible way to the changes in real spatial geographies (2001, 14). Since the 
publication of Maps in Mind: Reflections on Cognitive Mapping by geographer Roger 
Downs and psychologist David Stea in 1977, the concept of “mental maps” has been 
widely discussed (Schenk 2000). According to Downs and Stea, a mental map is a 
structured image of a part of a person’s spatial environment. This map does not 
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Figure 17.8. Station scale of a French receiver showing in three wavelengths: 
short wave (16-50 meter, left), médium wave (200-550 meter, middle) and 
long wave (1200-2000 meter, right). Source: Photo by Dan Talson © 


have to be correct in a geographical sense. It is mirroring the world as the person 
envisions the world. As Janet Vertesi argües, the iconic image of the London tube 
map has been incorporated into users’ cognitive mapping of London despite the 
obvious divergence between the real topography of the city and its idealized repre- 
sentation on the map. Subscribing to Vertesi ’s plea not to sepárate the analysis of 
representations of technological Systems from a community’s interaction with them 
(Vertesi 2008, 26), I argüe that the interaction with the radio station scale can be 
interpreted as an active process of mental mapping on the part of the users. 

The dial can be read as an invitation to an imagined ether voyage, where 
London, París, Oslo, or Hilversum were just one turn away from each other. They 
evoked in the radio listener — who is a radio watcher, too — a mental map whose 
fault line could be decoded only by the listener. Station scales, then, served as early 
atlases of globalization, and one may interpret the use of radio in the 1920S and 
1930S as a symbolic appropriation of the European broadcasting landscape. In the 
words of Paddy Scannell, “It is not just that radio and televisión compress time and 
space. They create new possibilities of being: of being in two places at once or two 
times at once” (Scannell 1996, 91). In his inspiring essay “Ásthetik der drahtlosen 
Telegrafíe” [Aesthetic of Wireless Telegraphy], design historian Chup Friemert 
reinforces this thesis and links it to the functionality of the radio station scale: 
“The lust for the scale is the lust for the transportability of world events, as well as 
the lust for their disposability” (Friemert 1996, 49). 
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As Uta Schmidt has shown in her study on the appropriation of radio in the 
Weimar Republic and the Third Reich, the sense of seeing occupied a central posi- 
tion in the “economy of pleasure” (Norbert Elias). Listening to the radio — 
especially when considering the effect of illuminated station scales and “magic eyes” 
in the dark — evoked feelings of intimacy and aesthetic delight: 

The illuminated station scale has deeply engraved itself into the memory of many 
generations of radio listeners. Although many of the stations could be captured 
only with considerable effort on a good antenna, the stations promised a world 
outside the possibilities of bodily experiences and invited the listener to make 
fantastic imaginary travels. (Schmidt 1998, 317) 

In that way, the radio dial became a virtual roadmap for the journey through 
the ether, the radio station mutated into a station, inviting the listener to remain 
there for a while. Many ego documents (e.g., novéis, poems, song lyrics) describe 
the aesthetic fascination, juvenile excitement, and the stimulation of imaginary 
thought travels experienced while interacting with the radio dial with all of one’s 
senses. 18 In the Dutch book for young people De radio -detective: draadloze ogen 
[The Radio Detective: Wireless Eyes] by Leonard Roggeveen (1930), Hans, the 
book’s fifteen-year-old protagonist, experiences the “wonder” of traveling through 
Europe by simply turning the dial of his self-constructed receiver: 

Hilversum 2 was precisely in the right place. The red calibration indicator stood 

right in the middle of the 298-meter wavelength panel He moved along the 

whole dial, and most of the stations carne in very clearly. Luxemburg and Rijssel 
played dance music, a man in Munich said, “Achtung, meine Damen und Herren.” 

A symphony orchestra was performing in Strasbourg, “here national program,” 
said a lady in Droitwitch. . . . Brussels followed, 1 and 2, and London, Paris, 

Rome — yes, Rome! Hilversum 1, Vienna, Stuttgart, Beromünster! Hans listened to 
an opera in Paris, to gramophone music from Rome, lovely violin music fr om 
Stuttgart. All Europe was present: He traveled in one turn on the dial from 
Hilversum to Vienna, ffom Rome to London; the countries lost their borders. The 
most alien peoples stood next to him — next to him in his little chamber. He heard 
voices from strangers hundreds of kilometers away as if they were standing next to 
him, and he immediately understood the meaning of what he had read in so many 
radio advertisements: “Make the world your neighbor.” (Roggeveen 1930, 24) 

The new broadcasting space was not only shaped by the technological infrastruc- 
ture of sender networks around the world; it was also actively co-constructed by 
radio listeners, who became individual actors in the spatialization of their world 
(Geppert, Jensen, and Weinhold 2003). While turning the dial and tuning in a 
station (city ñame), the vast broadcast space was transformed into a specific, mean- 
ingful place. In this sense, we should interpret the acts of radio listening, viewing, 
and tuning as acts of symbolic appropriation of the world. To stylize the radio as the 
“monosensual stepchild of the century of audiovisual attractions” (Maase 2004, 47) 
misjudges both the aesthetic attraction and playful usages of wireless technology in 
a technical culture (Haring 2007) and the multisensorial experience of radio as a 
cultural practice. 
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NOTES 


1 On the “supremacy” of the visual in Western culture, see Jenks (1995). For a 
critique of this one-dimensional concentration on the visual, see Mitchell (2005). 

For a sophisticated analysis of the relation between seeing and hearing in the process of 
modernization, see Sterne (2003). 

2 “We were not blessed with three hands!” Advertisement by Murphy in The Wireless 
and Gramophone Trader (Aug. 30, 1930), 26. 

3 Telefunken-Sprecher, no. 7, 1927, quoted in Ketterer ( Funken . Wellen. Radio [2003], 
167). On the disappearance of real chauffeurs in the history of the automobilist see the 
chapter by Stefan Krebs in this volume. 

4 Quoted from Méadel (1994), 196. 

5 Because of the “double articulation” of the radio as both object and médium — 

a key distinctiveness of media technologies compared to other technologies — Fíaddon and 
Silverstone speak of successful domestication only when a communication technology has 
been tamed as an object and socially appropriated as a médium. “With both dimensions of 
doubling in mind it becomes both possible, indeed essential, to see the innovation of 
media and information and communication technologies as a fundamentally social 
process, and the particular complexity of the design/domestication interface of these 
technologies as one which requires an understanding of the role of both producer and user 
in its definition.” (Silverstone and Haddon 1996, 50). 

6 Significantly, the notion “user illusion” was coined by the American Computer 
pioneer Alan Kay, who first worked at Xerox PARC and later became one of the founders 
of Apple; he was to be instrumental in the development of human/machine interfaces. For 
a detailed history of the invention and development of the graphic user interface (desktop 
metaphor), see Friedewald (1999) and Bardini (2000). A more than inspiring introduction 
to the philosophical and epistemological dimensions of the “user illusion” and “desktop 
philosophy” is offered by the Danish Science journalist Tor Norretranders in The User 
Illusion (1998). 

7 The creation of the different committees naturally affected the working methods of 
the whole organization. In fact, each question or problem to be discussed by the IBU was 
first submitted to the General Council, which subsequently relegated it to one of the 
committees. Next, they had to charge one of their members to produce a report, which 
was then examined by the General Council or — in some cases — the General Assembly. The 
frequent meetings of the different working committees circulated between countries, while 
the General Assembly met in Ouchy-Lausanne in Switzerland on a yearly basis (see UIR 
1944, 16). 

8 Raymond Braillard, “Rapport de la réunion de la Commission Technique á 
Lausanne,” 31.05.1928, p. 15. In IBU/EBU archives, Geneva, box 8oter / Com. T. au Conseil. 

9 A variety of factors, such as the different topographic conditions in the United States 
and Europe, the emergence of the commercial network system in the United States versus 
the predominance of the public Service broadcasting in Europe, and the great diversity of 
languages and dialects in Europe in comparison to the United States, make the European 
situation rather different and complicated when compared to developments in the United 
States. This complexity clearly called for a more precise and accurate technical regime. On 
the United States, see Slotten (2000). 

10 For a richly illustrated encyclopedia of early Germán radio sets together with short 
portraits of radio manufacturers, see the five volumes of Abele (1999). 
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11 See “Aktennotiz über de Besprechung mit Herrn Routin am 10. Dezember 1930,” 
Deutsches Technikmuseum Berlín (DTM), Historisches Archiv, I.2.060 C, no. 1750. 

12 See the booklet of the Telefunken receiver with Auto-Skala in DTM, Historisches 
Archiv, III Sog. 2 Firmenschriften, no. 51641. 

13 For a useful sociological overview of the relationship between technology, usage, 
and representation see Flichy (2008). 

14 “In uw eigen milieu,” Telefunken advertisement in Radio Gids, Officeel Orgaan van 
de Vereniging van Arbeiders-Radio-Amateurs 5(13) (Jan. 24, 1931). 

15 In her masterful study of the cultural and social appropriation of the radio set in 
Weimar and Nazi Germany, Inge Marfiolek refers to a 1933 Telefunken ad for the Viking 
model to demónstrate the multifaceted character of radio ads. This Viking ad, printed as 
double-page leaflet, had a strict gender-related layout: The left side was headed “Wie sie 
ihn sieht” [How she sees it], while the right side had the headline “Wie er ihn sieht” [How 
he sees it]. While the left (female) side was visually dominated by a picture of a little boy, 
smilingly turning the knob to adjust the station scale and thereby demonstrating the 
simple use of the set, the right (male) side showed a photo of the interior of the set, 
representing its “organic and stable” assembly. The ñame of the model itself was a 
tribute to the new National Socialist power in Germany: “Viking. A proud ñame from 
the time when Germanic sailors accomplished great things in small boats.” See Riedel 
(1994: 311-12). 

16 See letter from Joseph-Louis Routin to Mr. Levy, París, Jan. 27, 1931, Annexe 1, p. 1, 
in DTM, signature I. 2. 060 C, 1750. 

17 “Probably the most noteworthy development of design today is its concern for the 
cognitive meanings, symbolic functions, and cultural histories of form. . . . Product seman- 
tics is the study of the symbolic qualities of man-made forms in the context of their use 
and the application of this knowledge to industrial design. It takes into account not only 
the physical and physiological functions but the psychological, social, and cultural context, 
which we cali the symbolic environment. Product semantics is an effort to understand and 
to take full responsibility for the symbolic environment into which industrial products are 
placed and where they should function by virtue of their own communicative qualities.” 
(Krippendorf and Ritter 1984, 4) 

18 At present I am collecting ego documents in order to underpin my arguments with 
more historical relevance on the side of the users. So far I have collected more than 150 
references to the radio dial in Germán, French, and English novéis and children’s books, 
some 25 references in poems, and 15 references in song lyrics. In addition to that, many 
advertisements of the 1930S use the metaphor of the radio as an “ear to the world” as a 
parable for modernity and present the radio dial as a virtual atlas or road map of 
Europe and/or the world. A thorough narrative analysis of these ego documents and an 
iconographic analysis of the advertisements based on the concepts of “parables” and 
“visual clichés” as introduced by Roland Marchand (1985) will be the topic of a future 
publication. 
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CHAPTER l8 


FROM LISTENING 
TO DISTRIBUTION: 

NONOFFICIAL 
MUSIC PRACTICES 
IN HUNGARY AND 
CZECHOSLOVAKIA 
FROM THE 1960S TO 

THE 1980S 


TREVER HAGEN WITH TIA DENORA 


Introduction 


This chapter presents an empirical case study of the wide variety of nonofficial 
settings and reinventions of music listening, recording, and distribution technology 
in Hungary and Czechoslovakia from the 1960S to the 1980S. 1 As much as possible 
we compare the two sites, but our main purpose is to use the data to discover how 
individuáis used sounds and to consider the question of what those sounds, coupled 
with their uses, enabled these individuáis to do. We also use the case study to think 
more abstractly about sound and music as a resource for collective agency and 
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action. Here we are interested in how collective agency takes shape in relation to 
what we describe as “creative constriction”: the paradoxical situation whereby 
suppression and control generate new opportunities for Creative action. We 
sitúate this Creative constriction in terms of how its specific sound technologies — 
their construction, consumption, and appropriation — afforded alternative and 
socially important cultural practices that were informally learned via sonic/musical 
experience. These “lessons” in turn provided a springboard for nonofficial 2 modes 
of being that coalesced at individual and collective levels. 

Forms of nonofficial listening, recording, and distribution practices in 
both countries during the communist era examined here involve attention to 
radio broadcasts, listening to and trading LPs from the West, 3 and producing 
magnitizdat — self-made recordings. Musical experience, then, is understood as the 
intersection of sounds, music, technologies, and places. Music, in this understand- 
ing, is a flexible médium — a liminal space — one in which all the fine shades 
of an actor’s lifeword can be displayed. This display, we suggest, permitted 
music listeners to pursue — to varying degrees — alternative or independent ways of 
being and feeling, from dipping a toe in nonofficial waters to plunging in and never 
resurfacing. We use the term official to describe areas of society and culture that 
were defined and ordered by centralized powers of the State, manifested not only in 
institutions and agencies but also in everyday practice, as Jakubowicz puts it, “to 
achieve a commonality of enthusiastic commitment to building communism” 
(Jakubowicz 1994, 271). 

Central to our argument in understanding the liminality between official and 
nonofficial worlds is the (social) activity of making music and musical meaning, 
which Small calis “musicking” (Small 1998, 8). We expand the concept of musicking 
to take on any number of forms, such as attending concerts, tuning in to the radio, 
practicing scales, humming, imagining music, singing along to an LP, making 
sound compilations, and bootlegging performances. The musicking we study pro- 
vides and sustains a variety of moods, commitments to nonofficial culture, modes 
of attention to music, and emotional and knowledge structures, which we refer to 
as dispositions. This congruence of dispositions via musicking is the part of the 
collective formation of a liminal space that creates “a common shared world of 
time, space, gesture, and energy, which nevertheless allows diversity and unity” 
(Pavlicevic and Ansdell 2004, 84). 

Throughout, our aim is to consider the two communist regimes in Hungary 
and Czechoslovakia in regard to control over music, youth policy, technology, and 
culture. Both “creative constriction” and “dipping into” are present in each country 
but take different forms. It is important to note that what follows is not intended to 
be a comprehensive history of sound technology and its uses in Hungary and 
Czechoslovakia or a fully comparative study as that is beyond the scope of this 
chapter. We chose to study Hungary and Czechoslovakia because the attitudes of 
their respective regime to popular music, the level of economic reform in each 
country, subsequent access to technology, and the lack of informal information 
exchange between music amateurs between these countries highlight some of the 




442 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


pathways of Creative constriction that we identify in nonofficial musical life between 
1960 and 1990. In what follows, we address two areas common to both regimes, 
nonrecordable and rerecordable sound technology. In these nonofficial cultures, 
two particular modes of production and distribution were held in common: 

Samizdat refers to self-published textual material that ranges from manuscripts 
to material that is typewritten and copied by hand, typewriter, or mimeographed. 
The existence, form, and context of samizdat, as well as the amount produced for 
dissemination, varied from country to country (Machovec 2009). Self-publications 
were created without the permission or consultation of the authorities in either 
country. 

Magnitizdat refers to the recording and distribution of sonic material that was 
not available to the public, music that was banned or censored, sound that could be 
seen as potentially subversive, or music that was wanted immediately. Magnitizdat, 
in terms of distribution in the shadow economy or black market, was not necessar- 
ily subversive but instead filled a market gap in times of shortage (Smith 1984). In 
the Eastern-bloc countries, a certain amount of bricolage, or “situated experimen- 
tation,” was employed, of the type found in settings where resources are limited and 
speedy results are required (Büscher et al. 2001). For example, some of the earliest 
methods of magnitizdat aróse from discarded X-rays in the Soviet Union in the 
1950S: The emulsión on the X-ray was a material that one could engrave as one 
would a record (Ryback 1990, 32-33). The process of production and distribution 
was known as Roentgenizdat [playing the bones], and playable at 78 rpm on a 
seven-inch record player. 


Hungary’s “Goulash Communism” and 
Czechoslovakia’s “Normalization” 


Following the de-Stalinization process in both countries during the 1950S, economic 
reforms were introduced in the 1960S, which allowed for an opening and a liberal- 
ization of the market, social rights, and relaxation of Creative constriction. János 
Kádár, who took power following the 1956 Hungarian Revolution, installed liberal- 
izing mechanisms within Hungary’s economy — which remained a socialist-planned 
economy but contained elements of a market system, such as prívate businesses and 
enterprises (Kaufman 1997, 31). This new era of mixed economic models was 
brought about primarily through Hungary’s New Economic Mechanism (NEM). 

The NEM also sought economic reform through deregulation, which, for 
example, granted more licenses to artisans and small businesses, thus reducing 
the state’s monopoly. In this atmosphere, there was added room for amateur 
musicians, and rock music was permitted to reach listeners via the mass media 
(Szemere 1983, 123). While the NEM was supported within the Politburo, the 
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Central Committee, and the General Secretariat, it was eventually halted as a result 
of hardliners’ efforts in the party, coupled with the 1968 Warsaw Pact invasión of 
Czechoslovakia, demonstrating that the economic business of the satellite States — 
even after de-Stalinization — was still of much interest to Moscow. The goals of the 
NEM were thus stalled in the ’yos but eventually returned in practice in the early 
1980S (Adair 2003). 

These new reforms were characteristics of the Kádár regime’s “Goulash 
Communism” — a form of communism particular to Hungary. It enabled the 
country to be one of the most reputable Eastern bloc States in which to live. Ultimately, 
Goulash Communism contributed to, as Adam Przeworksi (1991, 20) recalls: 

an implicit social pact in which the elites offered the prospect of material welfare 
in exchange for silence. And the tacit premise of this pact was that communism 
was no longer a model for a new future but an underdeveloped something 
else. . . As . . . Hungarian surveys showed, the outcome was a society that was 
materialistic, atomized, and cynical. It was a society in which people uttered 
formulae that they did not believe and that they did not expect anyone else to 
believe. Speech became a ritual. 

Similarly, Anna Titkow (1993, 274) describes this condition as “cognitive disso- 
nance,” which emerges in the gap between reality and the representations 
produced in the State media and official propaganda and the gap “between the 
ideáis of socialism as preached by the propaganda apparatus and as practiced by the 
System itself’ (Jakubowicz 1994). 

During Kádár’s regime the second economy began to swell in Hungary — by 
1982, more than 75 percent of the population relied on it to contribute extra income 
to their formal wage, not just supplementing it but in many cases amounting to a 
higher income in the second economy (Sampson 1987, 126). The second, or “shadow 
economy,” manifested in a variety of ways, such as peasants on farms selling their 
produce, families renting out a room in their apartment, bribing a butcher for 
choice meat, prostitution, and so on (Sampson 1987, 121). In many parts of Eastern 
Europe and the Soviet Union this second economy provided a “lubricating” func- 
tion, but this was even more so the case in Hungary, where it was an integral part of 
the planned economy and helped stem shortages and production bottlenecks 
(Sampson 1987, 122). 

Czechoslovakia, on the other hand, continued its post-1968, Moscow- approved 
path of social and economic normalization under Gustáv Husák. After consolidat- 
ing power in 197o, 4 the Husák-dominated socialist State began the first political 
process of ££ normalizing“ Czech society, which lasted until the late 1980S. As Ule 
characterized the situation in the 1970S, ££ [The] aim [of normalization was] the 
reinstitution of the status quo ante and expiration of the liberalizing heresies of the 
Prague Spring 1968. Prominent among the measures of normalization has been the 
introduction of thorough censorship” (Ule 1978, 26). Although often considered a 
return to the Stalinist practices of Czechoslovakia during the 1950S, normalization 
was characterized not by overt coerción but by extrajudicial socioeconomic 
hardship. For example, if someone’s son or daughter was caught by the pólice 


444 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


distributing banned material, that person might be prohibited by the State from 
attending a university (Kreidl 2004) — often such nonofficial practices endangered 
networks of connections between friends and neighbors. 

Along with these economic and political changes in both countries carne adjust- 
ments to cultural policy. In Czechoslovakia, the reconfiguration of the rock music 
scene was achieved in part by keeping official rock bands in line with the minister 
of culture’s new policies, primarily by regulating hair length, limiting musical 
genres, instigating a “no English” policy, and editing lyrics (Ryback 1990, 143) — 
a dramatic turn from the thriving bigbít music scene in the latter part of the 1960S. 
At the center of this Creative constriction were required “requalification exams” for 
musicians. These were taken every two years in order to obtain a license to play 
professionally or even as an amateur. As state-run institutions, the licensing agen- 
cies functioned as a “censorship mechanism.” They had the authority to determine 
which musicians were allowed to perform based not only on exams that tested their 
knowledge of musical theory but also on their familiarity with Marxism-Leninism, 
their presentation, the lyrical content of the music, and the length of their hair 
(Vanicek 1997, 33-37). Saxophonist and guitarist Mikolás Chadima describes seven 
points that musicians had to abide by if they wanted to pass the exam: 

First, no English band ñames! Second, no long hair! Third, no English texts! 

Fourth, be properly dressed! Fifth, don’t play music which is “too wild”! Sixth, 

learn the rudiments of music theory! Seventh, don’t argüe with the adjudicators 

and let them Ínflate their ego at your expense! (Vanicek 1997, 47) 

The agency tested these seven points in musical auditions, oral tests of political 
theory, and finally a written test of Western music theory (Vanicek 1997, 47-50). 
While exams to determine a musician’s ability to play music still existed in the 
1960S, the new requalification exams also established a musician’s place in “normal- 
ized” Czech society, in that one could not pass the exam without an adequate 
knowledge of, for example, “the history of the worker’s party . . . who the Minister 
of Culture was . . . or their opinions on communism” (Vanicek 1997, 49). Efforts 
made by the Ffusák government from 1970 to 1973 to curb rock’s growing interest 
among the youth population culminated in the implementation of the exams in 
1973, 5 which segregated musicians as “official” or “nonofficial” in Czechoslovakia 
for the remainder of the communist era. 

In Hungary the State institution for the National Management of Light Music 
also instigated practice licenses, defined performance fees, and employed musical 
proficiency tests (Szemere 1983, 131). Ffowever, responses to rock music were based 
on strategies of a modified form of commercial inclusión rather than on a división 
between official and nonofficial musicians or explicit repression of the latter. At its 
most visible, this inclusión took the form of giving bands recording contracts but 
not allowing them to record some of their most popular songs. Moreover, the 
release of an álbum could be delayed for years, long after the popularity of its pieces 
had waned. 6 The second economy, however, took care of this bottleneck through 
bootlegged, copied, and exchanged audio cassette tapes. 
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Regardless of the Hungarian regime’s restrictions, the situation was still entic- 
ing to some since being a rock musician in Hungary during the ’yos and ’8os 
permitted musicians to receive royalties and tour extensively not only throughout 
the Soviet bloc and the Soviet Union itself but also in other parts of Europe and 
the United States if sponsored by the regime. In short, Hungarian musicians still 
had a degree of freedom relative to their Czech counterparts. The containment was 
less strictly enforced, and officially there was no censor. 7 As János Kobor of the 
official rock star group Omega attested in 1987, “There is no strong censor, but they 
are, you know, careful. When we made our last álbum, the record company asked 
us about the title. They said, this c Dark Side of Earth,’ does it refer to Hungary or 
does it refer to the socialist world? And then after we said no, then it’s all right, no 
problem.” 8 

The Hungarian form of control by inclusión also involved a method of tactical 
overexposure of musicians, which diluted their potential threat. One of Hungary’ s 
first punk groups, Beatrice, was subsumed into the official music scene after a series 
of meetings with representatives of the official culture. This led to a TV spot to dis- 
cuss punk music, competing in a song contest sponsored by Hungarian radio, and 
being offered a support-band role in a tour with the superstar groups Omega and 
Locomotive GT. Seen by fans as “selling out” and leaving their subversive message 
behind by playing with the establishment bands of Hungary, Beatrice was not helped 
when the State released a live concert álbum of the bands (Ryback 1990, 173-74). For 
the government it was doubly effective: overexposing a subversive band while also 
improving record purchases by the youth, illustrating the regime’s inclination to 
institutionalize commercial success. 

Within Goulash Communism, Hungarian official cultural life had a loose 
categorization of bands called “the three Ts” (türt, támogatott , tiltott [promote, 
permit, prohibit]): Certain bands, such as Omega, were promoted, while others 
were permitted to perform but not to record, and some were not allowed to play 
under any circumstances. 

For both regimes, a key problem was not only how to address restrictions 
on musical practices but also how to assert policy and control in relation to 
sonic and technological areas. How, for example, were they to protect radio 
frequencies from unwanted broadcasts, and should they monitor and assess the 
ideological content of cultural producís? These discussions over policy ebbed 
from the 1960S to the 1980S; however, restrictions remained in place in both coun- 
tries until 1989. Moreover, technological innovation — particularly gadgets, sound, 
and video — themselves raised ideological questions across the bloc. Since such 
devices were invented and produced in the West, how could a regime import 
such objects without destabilizing the foundations of the communist system 
(Kusin 1987)? Each regime met these issues in different ways, which in turn affected 
access to the latest technological and cultural products such as albums, films, 
literature, and clothes. The regimes thus became active participants in how music 
was experienced by their direct involvement with and control over sonic-related 
matters. 
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Nonrecordable Sound Technology: 

Radio and LP Listening 


Listening to the radio had always been legal in both Hungary and Czechoslovakia. 
In fact, it was often either encouraged or compulsory in situations such as collective 
listening to State propaganda in the workplace or barracks or the more passive lis- 
tening found in public areas that had loudspeakers (Rév 2004). However, certain 
radio stations — Radio Free Europe (RFE), Voice of America, BBC — were banned, 
and listening to such prohibited radio contení could lead to punitive measures. 
Nonetheless, radio, which was ubiquitous, inexpensive, and manufactured in the bloc, 
provided a readily available entrée into nonofficial musical experiences and did not 
require the listener to acquire any illegal or prohibited technology; to dip into this 
nonofficial practice, an individual only had to tune in at certain times to specific 
radio frequencies. Moments of “dipping in” were made possible simply by listening. 

While this mode of listening could have been for either informative, news- 
related purposes or for musical enjoyment (often the two were wrapped up into 
one program), this activity was risky and possibly dangerous. For example, if listen- 
ing in a block of apartments, neighbors could overhear the unusually unsually loud, 
hourly station update: “This is Radio Free Europe on the íóth, i9th, 25th, 3ist, 4ist, 
and 49th shortwave bands” (Rév 2004, 5). 9 In this sense, the radio sound — both 
volume and contení — afforded peripheral modes of listening: overhearing and 
eavesdropping. Not to be confused with “bugging” or State surveillance methods, 
overhearing and eavesdropping emerged from the sociopolitical circumstances of 
living situations such as the overcrowding in urban areas and simply the disposi- 
tions of curious people. These practices were part of what can be described as the 
necessary “skill set” in a second economy and also included skills Grossman calis the 
“4 Bs of resource procurement, bribery y bartering , black marketeering , and ‘blat, ” 
a Russian term for “connections or influence” (Grossman, quoted in Sampson 
1987, 128). 

While we do not mean to assert that individuáis were inclined to eavesdrop in 
these situations for malicious purposes, in some contexts information about the 
tenant living upstairs in a larger apartment with a balcony could be used advanta- 
geously if one wanted. On the other hand, a friendly neighbor who overheard 
someone listening to Western broadcasts was in the precarious situation of possess- 
ing unwanted and potentially harmful information. Underpinned by sound, these 
modes of listening in prívate spaces — overhearing and eavesdropping — heightened 
people’s fear and at times placed them in danger. 

Moreover, while listening to the radio in prívate, one was also subject to the 
noise of radio jamming from centralized locations in Budapest. Listeners in 
Czechoslovakia referred to Soviet jamming as “Stalin’s bagpipes.” As Rév (2004) 
argües, the noise let the listeners know that even in the confines of their own homes, 
the regime was still able to censor, control — and at the very least — be present in the 
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lives of citizens (Bijsterveld 2008). Fear and anxiety could to some extent be allevi- 
ated by well-timed listening. It was possible to listen during the early hours of the 
day or late at night and thus avoid the jamming. Quoting K. R. M. Short (1986, 6), 
Rév (2004, 25) illustrates the temporal strategy of listening: 

The timing of the broadcasts is also important because the twilight hours of 
morning and evening are the most ineffective of Soviet-originated sky-wave 
jamming. This is because the western broadcasts can take advantage of the 
ionosphere’s “solid” condition at these times, while the eastern jamming 
broadcasts have difficulty in achieving a reasonable reflection in their “broken” 
section of the ionosphere. This creates a time-related gap in the Soviet defenses. 

The strategic listeners were thus able temporally to configure their sonic space 
and practices in order to hear more clearly and to avoid the omnipresent regime’s 
jamming noise. 

One very popular RFE program broadcast into Hungarian-speaking regions 
was “Teenager Party,” which played Western rock to listeners and accepted post- 
cards and letters requesting songs from the Beatles, Jimi Hendrix, the Bee Gees, 
Paul Simón, and other popular Anglo-Saxon bands. 

These radio programs expressly provided an opportunity for informal musical 
learning through imitation. Nagy Feró, leader of the Hungarian punk group 
Beatrice, stated that budding musicians turned on the radio and learned music by 
“listenfing] to ‘Teenager Party’ and copying the songs phonetically.” 10 In this way, 
new informal practices were acquired through musical experience — in this case, 
imitation and modeling. Similarly, Canadian musician Paul Wilson, who lived and 
performed in the Czech Underground during the late ’6os and ’/os, describes radio 
listening in Czechoslovakia during the 1960S: 

One of the things that censorship [before the Prague Spring] did very badly was 
keep music out of the country. One of the things that was very marked in the 1960S 
was that although intellectuals found it very hard to get a hold of books it was very 
easy for kids to be right on top of things because records were brought in and the 
music was broadcast over Voice of America and other radio stations. So, there was 
a very current music scene [in Prague], with a lot of knock-off bands and a lot of 
fans of different groups just the way you’d find them in the West. (Velinger 2005) 

Censorship efforts by both regimes to restrict radio listening had unintended results: 
They fostered a culture of exposure to new music. Each week new sounds carne over 
the ether, and musicians developed listening strategies that in turn provided infor- 
mal training. They imitated the music they heard and formed ÍC knock-off bands.” 
Thus, music was passed from one sound technology to another and from person to 
person through various practices of distribution and musical “information.” These 
practices ultimately had implications for what actors could do and for how musical 
experience carne to be linked in collective efforts. 

Because listening privately was a common practice, it is clear that there were 
many actors participating collectively yet separately in listening; in other words, 
while the sonic experience was happening a in the room,” it may not have fostered 
microsocial interaction but rather a felt, collective experience. Radio broadcasts 
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from abroad, as well as this collective experience and interest, led both Czechoslovakia 
and Hungary to produce rock radio programming in the 1960S; the regime itself 
imitated Western forms of cultural technology. This start-up of rock radio pro- 
gramming as a result of popular demand reflected the growing “confusión” pro- 
duced in these communist regimes by rock music and its growing significance for 
young people (Ryback 1990, 88). While the State frequencies expanded rock airplay 
in the ’6os, it was inevitably halted in Czechoslovakia after the Soviet-led, five-army 
Warsaw pact invasión in 1968 — radio programming in Czechoslovakia returned to 
brass-band broadcasts shortly there after. 


Musical “Cues” for the Room 


In addition to the collective-yet-separate radio listening in apartments and houses, 
people could move easily between collective listening practices afforded by LPs or 
recordable technology such as open-reel tape recorders, which were not transmit- 
ted through the ether. One member of the Czech Underground (Frantisek Stárek) 
described his radio/LP-listening practices in the late 1970S as follows: 11 

FS: While living on the commune, we used to finish meáis together and then put 
on V oice o f America. 

TH: Where did you listen to the program? 

FS: In the kitchen. It was the only room big enough for all of us to fit. We had 
about ten to twelve people during the week and many more [Undergrounders] 
at the weekend. 

TH: How did you listen? 

FS: Well, everybody was silent. Even the kids who normally ran around knew that 
they should be quiet at this time. We wanted to hear about what was going on 
in our country. 

TH: And did you listen to music programs as well? 

FS: Not really. Afterjward] we’d listen to Underground [bootlegs] and drink. 

The listening experience in this Underground commune illustrates the more 
general, fluid, and múltiple listening tendencies common at the time. As listeners 
moved from one sound technology to another — from radio to open-reel bootleg 
concert recordings — the listening practices changed slightly. Listening together 
in a room that afforded microsocial interaction was not always possible. In such 
cases, technology created sound for the room: playback for prívate gatherings and 
parties in apartments, bedrooms, and weekend homes, where music provided both 
the backdrop for, and cues to, social interaction. Listeners, technology, and the 
sociopolitical context repeatedly coproduced the culture of the space — illegal, 
alternative, and filled with anxiety and excitement. 

While the radio offered variation and the possibility of experiencing new music, 
this new pattern of listening behavior contrasted with LP culture in Czechoslovakia, 
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where large collections were rare, and the same albums were often played repeat- 
edly in domestic environments. The selection of Western LPs was quite limited in 
Czechoslovakia, and listeners who wanted to acquire LPs turned to the weekly black 
markets in Prague or asked relatives and friends abroad to send albums. 

The Hungarian Record Company (HRC) had quality record-pressing technol- 
ogy in the 1960S and 1970S and doubled its output of LPs from 1975 to 1980 in an 
attempt to meet the needs of younger audiences (Szemere 1983, 130). This trend 
toward pop music stemmed in part from Hungary’s policy of moving away 
from isolationism or Sovietization in the 1950S to a “windows to the West” policy 
in the 1970S. As Cultural Secretary Aczel addressed delegates of the Society for 
Dissemination of Scientific Knowledge in 1968: 12 

There are people who demand the closing of our borders to cultural and 
intellectual exchange; they want to isolate our culture. It would be impossible to 
implement such a policy, but even if we were to succeed in doing so, we should 
create a situation, which would leave us ignorant and weak; a robust plant cannot 
be grown in a hothouse. 

The culture secretary’s stated position of “peaceful coexistence” with the West 
helped to widen the gap of rhetoric and practice in Hungary: Although there was an 
abundance of Hungarian LPs, foreign albums were still highly restricted, and one 
had to rely on methods similar to those in Czechoslovakia: obtaining LPs from 
relatives in the West, black markets, or by chance finding a secondhand Yugoslav- 
pressed álbum in a store. 

Modes of listening are thus tied up with subsequent access to technology and 
musical recordings — how one would procure an object and where it was listened to 
all contributed to the shaping of the musical experience. According to Ruth 
Finnegan (2003, 183), listening as musical experience involves learning “how to feel, 
how to deploy particular emotions in contexts appropriate to our situation.” 
Moreover, if emotional structures can be the seeds of knowledge production 
(Witkin and DeNora 1997, 5), it is possible to see musical experience molding a flex- 
ible, liminal space — a space where one can view and imagine both the nonofficial 
and the official lifeworlds (Eyerman 2006). Musical experience is thus at once lived, 
embodied, and “intertwined with culturally diverse epistemologies” (Finnegan 
2003, 183) from a range of cultural resources mediated by its interaction with sound 
technology. 


Reconfiguration and Repackaging of 

Music and the Actor 


The listening pattern produced by playing LP collections as opposed to listening to 
diversified radio programming provided a further entrance point into the liminal 
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space by learning how to hear music: Discussing the music in the physical space of 
listening was one way in which people were able to reconfigure themselves in an 
alternative manner. 

In the late 1960S in Czechoslovakia, a handful of musicians who rejected the 
newly instigated official music standardization moved to private gatherings to listen 
to music, talk about music, and share the small collections of music they possessed. 
Here, in these private gatherings while listening to LPs, they would also experience 
running monologues and informal lectures on aspects of Czech cultural history, 
orated primarily by Ivan Jirous, art historian and band manager of the Czech 
Underground group, The Plástic People of the Universe. 

The lectures discussed and explained current Czech rock music within the con- 
text of Czech musical reviváis, art history, and political history and maintained that 
a even in the darkest of times, the Czechs had always been able to keep the fíame of 
culture alive” (Jirous 2006 [1975], 7). The notion of a nonofficial lifeworld here, 
understood not in relation to “the Other” or based on difference, was rather the 
opposite: that rock music was not just a new phenomenon; instead, it was part of an 
experiential mode of culture that had long existed in the Czech lands — an alterna- 
tive mode of being present throughout the country’s history, to which the Czech 
Underground belonged. 

These lectures and ritual listening sessions, consisting of copious amounts 
of beer and dumplings, were described by Paul Wilson during some of his first 
meetings with the Czech Underground in an apartment in central Prague: 

[Jirous] would put on his favorite records on a battered turntable jacked into an 
oíd WWII radio ... I lay back and listened to the Velvet Underground, Captain 
Beefheart, the Doors and the Fugs, and as I listened, I began to feel a depth in the 
music I hadn’t felt before, as though I were hearing it for the first time with Czech 
ears. (Wilson 2006 [1983], 20) 

Here we begin to see how these sounds carne to be empowered as exemplary of 
Underground life: They emerge from the intersection of sounds, sound technolo- 
gies, and social rituals of listening to help construct a lifeworld; the musical experi- 
ence clearly afforded Wilson a different form of comprehension or depth of 
listening. These rituals helped form nonofficial dispositions that carne to structure 
the flexible médium of the Underground — a cultural space and mode of living that 
were particular to this community in Czechoslovakia. In this case, the musical 
sound was a flexible object that carne to be symbolic of an alternative way of life as 
it was combined with other practices of specifically Czech consumption (beer, 
dumplings, a Czech living space). 

To speak of the music’s flexible affordances is to highlight how the physical 
space for listening provided, in part, contextualization cues. DeNora (1986, 91) 
describes these contextualization cues as “various conventions or ritual practices 
that, through experience, come to carry certain connotations which serve as the 
tools for the work of sense-making and meaning construction.” In other words, 
cues help listeners to shape their interactions and appropriations of musical objects 
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and, in the case of Czech appropriation, as they immerse themselves, to varying 
degrees, in transformation. In the context of Czech listening rituals, the extramusical 
contextualization accomplished by “running monologues and informal lectures” 
achieved two goals: a reconfiguration of the musical contení and a reconfiguration 
of the actor’s mode of listening and attention from “non-Czech” to “Czech.” For 
Wilson, reconfigured as someone with “Czech ears,” the music then became a 
soundtrack for the placement of alternative modes of being within Czech history. 

Reconfiguring the space through active listening during radio and LP playback 
and collective listening connected actors to networks of feeling, being, and thinking 
and thus enabled actors to distance themselves from official society and to dip 
more than a toe into nonofficial culture. How far individuáis dipped in and how 
long they stayed immersed was, of course, dependent on a web of other social and 
familial ties. 


Recordable Sound Technology: 
Cassette Tapes and Open Reel 


The dispositions formed through radio and LP listening served to develop practices 
related and translated to other sound technologies, such as the rerecordable tech- 
nology of open-reel and audio cassette tapes. These sets of practices surrounding 
the exchange of bootlegged concerts, compilations, and LP copies became exten- 
sions of older sound technologies such as radio. These practices allowed for modes 
of being to shift in accordance with technological change: Cassette tape technology 
permitted exchange to develop into a mode of communication in its own right. 
Simultaneously, the new technology and the tape trading that it facilitated during 
the late 'yos and ’8os in turn enabled a wider culture of alternative sound as listeners 
forged new (and highly nonofficial) practices of musical exchange and recording. 
Tapes could now be acquired by ordering through samizdat magazines by contact- 
ing tape traders through postings in public places, by recording concerts, and by 
exchanging with friends and acquaintances. As tapes carne to be associated with the 
practice of tape archiving/collecting, the alternative articulations between music 
and collective action burgeoned. The once collective-yet-separate radio listeners 
were thus linked via active engagement and practices to one-to-one or many-to- 
many interactions involved in exchange, thus generating still more network links 
among the actors. 

This burgeoning informal exchange system — within which distribution became 
a cultural end in itself — in turn augmented the contextualization cues previously 
associated with radio and LP listening. Overall, these processes removed music 
from its original associations (for instance, how it was framed by artists) and from 
the conventional notion of creation/composition as a distinct phase, still further up 
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the chain in arts production (Becker and Pessin 2006). Instead, tape distribution 
recontextualized musical works by delivering them to consumer groups that their 
authors were not originally intending to reach. Thus, the music distributed via tape 
in Czechoslovakia and Hungary during this time derived its meaning and social 
power from its repackaging and its novel and adaptive forms of distribution. 
Cassette tapes, as well as the music they contained, heightened and expanded one’s 
personal network: Oftentimes someone owned a tape that had been received 
through distant connections and copied several times; the audio quality of cassettes 
copied many times over left a sonically anonymous trail of the network. 

In other words, the music was recontextualized by its modes of distribution, 
which brought into being new systems of collective representations as a result of 
tape exchange. The sound technology of cassette tapes and open-reel tape recording 
thus extended the simple act of listening (e.g., to a radio broadcast or to LPs 
exchanged on the black market) to acts of recording, compiling, and bootlegging. 
New associations, images, and ideas could, in other words, be hung on tones 
(DeNora 1986, 93), and here the associated practice of tinkering and the participa- 
tory design feature of tape trading/bootlegging come to the fore. Members 
of the Hungarian punk band CPG were one such group that was persecuted for 
its participatory design of a compilation tape of music and RFE commentary. 
The trial of the “punks of Pol Pot County” took place in 1982 in the Hungarian 
county/town of Szeged. The members of CPG were first charged with incitement 
for the cassette tape they were carrying. According to Szonyei (2005, 88), sand- 
wiched in between songs on the tape was political commentary on the martial 
law in Poland, recorded from RFE. Although the musicians were tried and acquit- 
ted for disturbance of the community, the pólice were unsettled not only by 
CPG and its actions but also by the possibilities of playback in the room and dis- 
semination to a wider audience. The broadcasts, torn from their RFE dispositif \ were 
crafted and adjusted to fit CPG’s own dispositions. In other words, cassette tapes, 
combined with radio, offered a type of radizdat that yielded an assertion of user 
control over the sonic environment. 13 

Various practices associated with exchange were in part facilitated by the 
regimes in each country. In Hungary in 1981, economic reforms that took the NEM 
into consideraron were finally starting to gain momentum. The changes taking 
place allowed for small businesses, such as record stores, to appear and function 
within Hungary’s liberalized/ing market. For instance, from 1970 to 1983 the number 
of private shops in Hungary doubled — to 19,293 (26 percent of all shops) (Sampson 
1987, 125). 

In part because Hungary had a more open market since the mid-i9óos, the 
practice of taping was somewhat different from that in Czechoslovakia. Artist 
Gyorgy Galántai was an avid taper who used a Sony tape recorder purchased 
through relatives in Vienna. Galántai, the founder of the alternative art archive 
Artpool recorded concerts for posterity and exchange abroad while receiving demo 
tapes from Hungarian bands. 14 In the early ’8os, Galántai began to produce a series 
of cassette tapes to exchange as mail art, which were described as “cassette-radio, 
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radio work” called “Radio Artpool.” The cassettes were a bricolage of interviews, 
music, ambient recordings, documentation of telecommunications concerts, 
improvisations, sound art, found sounds, spoken words, and so on. The eight-part 
series merged sound, posterity, distribution, and exchange into one object as an 
audio versión of a cultural journal, thus indicating a mode of listening attention 
that was active, critical, and expressly disseminated via the postal system. 

Sound technology and distribution in Czechoslovakia took on a different role. 
With a stricter regime line on popular music, there was less foreign music on the 
official market as there was in Hungary. Officially, there were few places to buy 
Western LPs, although a Czech label did release some American jazz imprints, and 
select albums could be purchased or ordered through the record clubs, Gramofonovy 
Klub and HiFi Klub (Vanícek 1997, 121). Moreover, cultural centers in other coun- 
tries served as places to pick up foreign music. At the Hungarian Cultural Center in 
Prague, for example, it was possible to listen to or lend albums by Omega and other 
Hungarian rock giants. 

In this rather large gap between official and nonofficial musical acquisition, 
Czech magnitizdat labels and distributors carne into existence. One such label, 
S.C.T.V (samizdat cassette tapes [and] videos) was primarily run by Petr Cibulka 
from his apartment in the Moravian capital of Brno, where his mother took care of 
the administrative side of the label (Vanícek 1997). While many people in the coun- 
try were still using open-reel recordings, Cibulka in 1976 was credited with produc- 
ing one of the first samizdat compilation tapes, which comprised many bands of 
the Czech Underground, who at that time had been imprisoned and set the stage 
for the launch of Charta 77. 15 Similar to the ways that Hungarian tapes were 
constituted, this first Czech compilation tape instigated the use of manipulatable, 
rerecordable technology as participant design and signaled the formidable quality 
that cassette tapes afforded as a do-it-yourself (DIY) set of practices. 

A similar independent initiative was started by Mikolás Chadima, who ran a 
small label, Fist Records, from his apartment. 16 Fist Records’ approach differed 
slightly from that of S.C.T.V. in that Cibulka held that magnitizdat and samizdat 
should be spread widely and in quantity, echoing dissemination as an alternative 
mode of being. Chadima similarly used distribution as a Creative mode of being; 
however, he focused on releases that met his idea of quality music and created cover 
art and liner notes for many of his releases (Vanícek 1 997). Although a majority of 
the taping and copying was done by these two individuáis, the trading and exchange 
operations were carried out between friends, siblings, and acquaintances and thus 
involved a collective of people dipping into and reconfiguring a musical experience 
linked not only to listening but also to magnitizdat creation and distribution. 
With tape exchange, the informal learning shifted the contextualization cues from 
the physical space of the room, as described earlier, to the social ritual of distribu- 
tion. Actors, in turn, became reconfigured by “dipping in” while simultaneously 
reconfiguring the sound technology for their own use and purposes. 

For example, Chadima’s Fist Records often did not have sufficient personnel to 
complete all of the liner notes and every piece of cover art; thus, graphic designers 
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or individuáis who could help copy the art and liner notes would offer their time 
to Chadima, thus “dipping into” alternative practices a little bit more as tape 
distribution opened up new avenues. 

Liner notes could sometimes be found in samizdat magazines such as Vokno, 
which was printed and distributed by the aforementioned Underground commune 
in Czechoslovakia. Translated roughly as “memory loss” and also a play on the 
word “window,” Vokno’s samizdat distribution web spread across the republic 
from 1979 to 1989 and provided, as a part of the tape exchange, the contextualiza- 
tion cues for listening that the physical space afforded. In addition to liner notes, 
Vokno often contained artwork and details of recordings, not to mention order 
forms and inventory for S.C.T.V and Fist Records. Those who read samizdat or 
listened to magnitizdat were thus drawn further into a set of consumption practices 
that linked listening, technology, and contextual cues provided by magazines such 
as Vokno and the practice of exchange. 

If the Creative constriction of performing, recording, and distributing music in 
each country helped to set up exchange and taping as novel practices, they also led 
to novel forms of musically mediated learning. More specifically, they paved the 
way for the individual and collective learning of new dispositions associated 
with how to live/be nonofficially. Indeed, they provided object lessons in how to 
communicate nonofficially. 


Conclusión: The Liminality of 
Nonofficial Musical Experience 


We have described how sets of nonofficial musicking practices took shape in rela- 
tion to and in turn shaped sound technologies. 17 This mutual structuring process 
also gave rise to situations within which individuáis could test alternative disposi- 
tions and underground culture. Within these situations, the musical object was 
simultaneously an aesthetic item and its distributive technologies. Opportunities 
for musical experience in Czechoslovakia and Hungary provided a liminal space, 
one poised between official society and nonofficial life, affording fine degrees of 
commitment and deliberation. Because of the heterogeneity of this space, which 
was neither official ñor nonofficial, it was possible to gradually move into deeper, 
nonofficial waters and to learn about and adopt new, alternative dispositions in 
informal and not necessarily conscious ways. If we were to theorize about music’s 
role in relation to collective action and collective consciousness (and so to 
build upon earlier work on this theme in ways that seek to delinéate the actual 
mechanisms through which music provides “exemplars” for action and collective 
mobilization [Eyerman and Jamison 1998]), we would suggest that the musical 
experience, importantly, provides a two-way flow between individual and collective 
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world building: (i) an aggregating conduit to the development of dispositions 
(learning how to) and (2) a collective distillation of dispositions (from many 
experiences to one collective experience). This definition of musical experience as 
liminal space enables individuáis to take either the first steps into nonofficial life 
or to reaffirm and constitute their alternative disposition through control — or 
choice — over the sonic environment. In what we have described, the musical expe- 
rience has two parts: a sensory aspect (as in listening) and an active facet (as in 
participating in distribution as creation). 

The social changes in these nations in the late 1980S — breakdowns in different 
parts of government and dissident political initiatives by opposition groups — 
rested, we suggest, on a technologically dispersed, linked population of actors who 
had, at one point or another and to varying degrees, dipped into and informally 
acquired alternative modes of being. This “dipping in” (from toe to fully immersed) 
was, we suggest, a critical resource for the reconfiguration of dispositions and the 
increasing rejection of the Creative constriction imposed by official institutions. 


NOTES 


1 The research involved ethnographic interviews of more than twenty individuáis in 
both the Czech Republic and Hungary conducted 2007 and 2009. These interviews focused 
on musical experience related to sound technology. Interviewees were people involved in 
the practice of making and distributing samizdat and magnitizdat (see later explanation), 
as well as consumers of the material from the late 1960S to 1989. Additionally, archival 
resources were gathered from the Open Society Archives and Artpool in Budapest, 

Libri Prohibiti, Institute for Contemporary History (Üstav pro soudobé déjiny), and the 
Archive of Security Forces (Archiv bezpecnostních slozek) in Prague.Grounded theory 
methods were employed for analysis of the interviews and archival data (Glaser and 
Strauss 1967). 

2 Throughout, the term nonofficial is used not only to describe underground or 
alternative movements in each country but also to discuss the actions of any individual (as 
opposed to an organization or an office) — from a dissident to a merchant to a teenager. 

3 The “West” is described by one Czech listener as primarily the UK and the United 
States. 

4 Husák was both the president of the country and secretary general of the 
Communist Party. 

5 For example, Supraphon, the national recording label, halted or canceled all music 
projects that contained what were considered to be too extreme Anglo-American themes. 
Instead, the company played and recorded brass band music. 

6 Interview with Tamás Szónyei by Trever Flagen, Budapest, Flungary, July 3, 2009. 

7 Ibid. 

8 Charles T. Powers, “Changing Times for Hungary’ s Pop Music Giant Omega,” Los 
Angeles Times (Sept. 5, 1987), 1. Retrieved in PIU-300-40-2 Records of Radio Free Europe/ 
Radio Liberty Research Institute, Hungarian Unit, Subject files. 

9 As Rév (2004) notes, exiles from Hungary, who were surveyed by Radio Free 
Europe, consistently remarked on the volume level of the hourly updates in comparison to 
the volume level of the programs. 
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10 Agnes Sesztak, “ ‘Punk’ Star Interviewed, Describes Triáis, Tribulations,” Mozgo 
Vilag 12(86-91) (1984), 48. Retrieved in HU-300-40-2 Records of Radio Free Europe/Radio 
Liberty Research Institute, Hungarian Unit, Subject files. 

11 Interview with Frantisek Stárek by Trever Hagen, Prague, Czech Republic, Apr. 21, 
2009. 

12 Radio Free Europe research report, “Hungary’ s Windows to the West,” Oct. 28, 
1970; http://www.osaarchivum.0rg/files/holdings/300/8/3/text/34-6-85.shtml. 

13 Radizdat means dissemination by radio, transmitting banned sonic material back to 
the country of its production. It also refers to the transmission of Western music and news 
into the bloc. 

14 Interview with Julia Klaniczay by Trever Flagen, Budapest, Hungary, July 1, 

2009. 

15 Interview with Milos Mueller by Trever Hagen, Prague, Czech Republic, February 3, 
2007, and interview with Frantisek Stárek by Trever Hagen, Prague, Czech Republic, 

Apr. 21, 2009. 

16 Interview with Mikolás Chadima by Trever Hagen, Prague, Czech Republic, 

Jan. 28, 2009. 

17 The musical experience we have described has covered tuning in to the radio and 
listening to vinyl or illegally recorded material; it should be noted that jam sessions, 
rehearsals, and live performances would also need to be included in a broader definition 
but are not discussed here. 
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CHAPTER 19 


THE AMATEUR IN 
THE AGE OF 
MECHANICAL 
MUSIC 


MARK KATZ 


Introduction 


In 1906 the American composer and bandleader John Philip Sousa famously 
predicted that the rise of mechanical music — in particular, the player piano and 
the phonograph — would mean the end of the amateur musician. “[I]n this the 
twentieth century,” he wrote, “come these talking and playing machines, and 
offer . . . to reduce the expression of music to a mathematical System of mega- 

phones, wheels, cogs, disks, cylinders, and all manner of revolving things It will 

be simply a question of time,” he concluded, “when the amateur disappears entirely” 
(Sousa 1906, 279, 280). Sousa was the most prominent and influential figure to 
express concern about the deleterious effect of technology on musical performance, 
but he was neither the first ñor the last. As early as 1878, when Thomas Edison’s 
phonograph was barely a year oíd, a New York journalist suggested that “with the 
perfecting of the phonograph to the degree Edison’s industry and skill give prom- 
ise, the occupation of the skilled performer will be gone” (quoted in Feaster 2007, 
136). Unlike Sousa, this writer eulogized the professional musician, and in subse- 
quent years opponents of mechanical music sometimes foretold the death of one, 
the other, or both. Regardless, the predictions were uniformly dire as, for example, 
music historian Leonid Sabaneev’s 1928 prophecy that “Orchestras will perish and 
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disappear, like the ichthyosauri” (Sabaneev 1928, 114). Some even offered specific 
expiration dates: American critic Winthrop Parkhurst believed it would be the year 
2000, while British librarían Lionel McColvin saw it happening no later than 2031 
(Parkhurst 1931, 6; McColvin 1931, 322). 

Amateur music making was seen as particularly vulnerable, and it was said that 
few would want to expend the effort to make music for themselves when high- 
quality performances could be enjoyed effortlessly at home. Despite the warnings, 
the amateur musician never went the way of the ichthyosaur. To the contrary, from 
Sousa’s day to the present, amateur music making not only survived but also 
thrived. Moreover, musical amateurism persisted, not despite the presence of 
the player piano, phonograph, and their descendents; in good part it flourished 
in response to the possibilities of these technologies. 

This chapter explores the role of amateur music in the age of sound recording 
and reproduction technologies. It begins by returning to Sousa’s complaint in 
order to evalúate concerns about the fate of the amateur in the early twentieth 
century. Four brief case studies then examine the complex relationship between 
amateurism and music technologies. The first remains in early twentieth-century 
America to consider the unexpected uses of the player piano and the phonograph. 
The next two case studies focus on musical phenomena that aróse in the 1970S 
but remain hugely influential decades later — hip-hop and karaoke. Situated in 
the early twenty-first century, the final case study examines how digital music tech- 
nologies — in the form of video games and mobile phone applications — challenge 
traditional notions of musicianship and amateurism. Most of the examples I cite 
come from the United States, and my claims hold most strongly for American 
musical life; I do, however, draw evidence from Europe and Asia as well in order to 
suggest the global scope of technologically mediated amateur music making. 

In this chapter I take a strongly user-centered perspective. Although I grant 
that both design and marketing influence human engagement with technology, the 
case studies reveal a constant process of co-construction between users and tech- 
nologies and deny any kind of determinism strong enough to predict with great 
specificity how a given technology may be deployed. In fact, my own refrain will be 
that users have exerted, much more than is typically acknowledged, a decisive influ- 
ence on the development of sound technologies since the late nineteenth century. 
In taking this stance I am sympathetic with SCOT (the social construction of 
technology), given its focus on technology in its social contexts, as well as on the 
fields of cultural studies and media studies, which explore how users perform and 
construct identity through technology (e.g., Oudshoorn and Pinch 2003). 


The Menace of Mechanical Music? 


In the early twentieth century amateur music making existed in many forms. 
Amateurs often performed publicly, whether in municipal brass bands, choral 
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societies, or community orchestras and opera troupes; they could be heard in 
schools, at sporting events, in parks, on the streets, and in houses of worship. In 
prívate, amateurs gave home recitáis for neighbors, gathered around the parlor 
piano to sing popular songs, or played chamber music. 

The extent of early twentieth-century amateur activity is hard to quantify with 
any precisión, but it is possible to gauge its growth or decline indirectly by tracking 
the numbers of music teachers, which, at least in the United States, formed a census 
category. Music teachers are indeed professionals, but their clientele consists pri- 
marily of amateurs — most often children but also adults seeking to develop their 
musical skills. (Professional musicians tend not to take regular instruction.) 
Therefore, a rise or decline in the number of music teachers suggests a correspond- 
ing change in the number of amateur musicians. We can thus evalúate Sousa’s claim 
that, because of player pianos and phonographs, “it will be simply a question of 
time when the amateur disappears entirely, and with him a host of vocal and instru- 
mental teachers, who will be without field or calling” (Sousa 1906, 280). Contrary to 
Sousa’s assertion, the years immediately following the publication of his article saw 
an increase in the number of music teachers in the United States. If we take the 
period between 1890 and 1910 — one that included the introduction of commercial 
records and the launch of the extremely popular phonograph model, the Victrola — 
we see a significant increase in the number of music teachers and professional musi- 
cians. (Unfortunately for the present purposes, these two groups were counted as 
one category.) As table 19.1 (adapted from Harris 1915, 301) shows, modérate growth 
occurred between 1890 and 1900, but then a considerable jump took place between 
1900 and 1910, the period in which Sousa was writing. In twenty years the number 
of musicians and music teachers per capita increased by 50 percent. Clearly, neither 
the professional musician ñor the amateur suffered during that time. The same was 
true of later decades as well. A 1967 report claimed that the total number of amateur 
musicians in the United States grew by 141 percent between 1930 and 1960 (from 
13,000,000 to 31,300,000), far outstripping the 46 percent population increase 
during the period (from 122,773,046 to 179,323,175) (Report on Amateur Instrumental 
Music 1967, 4). 

What, then, do we make of Sousa’s clearly wrongheaded claim? Rather than 
simply dismiss him, we should consider what motivated him and like-minded crit- 
ics. Sousa may have cared about amateur musicians for many reasons; one of them, 
however, was that he depended on them for a good deal of his income. Every time 
Sousa wrote a new composition, he published it not only in its original form 
(usually for band) but also in a wide array of arrangements — for everything from 

Table 19.1 Number of Musicians and Teachers of Music in the United States, 


1890-1910 


Year 

( 7 . 5 . Population 

Musicians and Music Teachers 

Ratio 

1890 

62,622,250 

62,155 

1 per 1008 

1900 

75 > 994>575 

92,174 

1 per 824 

1910 

91,972,266 

! 39 > 3 10 

1 per 660 
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banjo to piano to zither — that could be played at home by amateur musicians 
(Warfield 2009, 449). Sousa therefore had good reason to oppose anything he 
thought might discourage the amateur musician and thus slow his income stream. 
However, instead of proposing actions that would directly protect amateurs — if 
such a thing were possible — he argued that mechanical music should essentially be 
taxed, so that any time a phonograph disc or player piano roll was made, the appro- 
priate copyright holder would be properly compensated. The timing of Sousa’s 
1906 article was no accident. The U.S. Congress was debating new copyright legisla- 
tion at the time, and his essay was part of a campaign to win over public opinión for 
copyright reform, in particular the expansión of copyright to inelude mechanical 
reproductions of music. Sousa’s efforts — which included not only his 1906 article 
but also his testimony before Congress — paid off, and the landmark 1909 U.S. 
Copyright Act included provisions that generated income for Sousa and other 
music copyright holders (Warfield 2009). 

The amateur was therefore essentially a red herring in Sousa’s article, for the 
copyright reform he advocated was not aimed to protect them, ñor was it aimed to 
slow the production of rolls or records. Given Sousa’s prominence in early twenti- 
eth-century American culture and the widespread discussion that his article gener- 
ated, it is fair to say that he, more than anyone else, set the terms of the debate about 
the valué and influence of sound-reproducing technologies. The terms he set were 
simple: The player piano and the phonograph were either bad or good for music. In 
surveying the literature of the time, however, it is clear that Sousa was not among 
the majority. Many commentators, among them prominent musicians and educa- 
tors, embraced these technologies as a means to elevate musical culture in the United 
States (Katz 2010, 56-79). Nevertheless, whether later observers agreed or disagreed, 
they tended to defend one of two binaries — good or bad — and given how long the 
debate has been raging, it is now difficult to avoid choosing one or the other. Indeed, 
I offer evidence of possibly salutary effeets of sound technologies on amateur music 
making. The purpose of this chapter, however, is not to defend mechanical music 
against its detractors. Rather, through the following case studies, my aim is under- 
stand how amateur musicians have responded to these technologies and thus gauge 
how amateurism has been transformed by — and has transformed — some of the 
most culturally significant and pervasive technologies of the past century. 


Case Study 1: Making Music 
with Machines in the Early 
Twentieth Century 


As Trevor Pinch and Karin Bijsterveld have suggested, we may fruitfully understand 
the influence of music technologies by observing how they breach longstanding 
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conventions when first broadly introduced (Pinch and Bijsterveld 2003). Perhaps 
the central convention that the earliest sound technologies breached was that 
music had always been understood as a quintessentially human activity. Although 
mechanical instruments (music boxes, musical autómata, and so on) had existed 
for centuries, these had not been seen as a serious threat to traditional musical cul- 
ture or commerce. However, with player pianos and phonographs, it became pos- 
sible to envision a future in which much of the music heard on a daily basis was 
made not by humans but by machines. Moreover, the phonograph in particular 
introduced something unprecedented in the history of music technology: the abil- 
ity to reproduce the human voice accurately. “Encountering a machine with one 
attribute so singularly human,” Lisa Gitelman writes in her study of the early 
phonograph, was for many an unsettling, even shocking experience. This shock, 
she explains, had “unequivocal implications for the mutual construction of the cat- 
egories Man and Machine since machines became more human when the phono- 
graph-cyborg breathed its first words” (Gitelman 2004, 332). How, then, did 
amateur musicians respond to this breach, this blurring of the traditional boundar- 
ies between agents and artifacts? Some might well have stopped making music 
while others might have continued their musical practices as before, unaffected 
by the availability of these player pianos and phonographs. But many others engaged 
with these technologies in musical ways. I see three broad categories of such engage- 
ment: treating the technologies as quasi-musical instruments, making sound record- 
ings of amateur performances, and performing live music alongside recordings. 

Owners of player pianos were not necessarily passive consumers of the music 
and often acted as what we might cali co-performers. Many instruments either 
allowed or required users to control their operation as the music played. Pumping 
pedáis or sliding levers, users could manipúlate tempo, volume, and accentuation 
throughout the performance. As Timothy Taylor points out, the proliferation of 
these instruments gave rise to an “industry of how-to guides, player piano teachers, 
and other modes of instruction” (Taylor 2007, 285). The manufacturers touted this 
interactivity as an opportunity to experience music in a new way. An advertisement 
from the Aeolian Company proclaimed, “The Pianola thus is found to be a plea- 
sure-giving instrument for all . . . enabling them to play on the piano with absolute 
correctness and with human feeling” (Roehl 1973, 7). A 1925 ad for the Gulbransen 
Registering Piano included this motto: “The Biggest Thrill in Music is playing it 
Yourself” while the makers of the Angelus Player Piano boasted that their instru- 
ments were equipped with the “ Melodant , the famous Phrasing Lever , and other 
expression devices” (Roehl 1973, 35). In another form of co-performance that 
prefigured karaoke, many piano rolls were intended to accompany singing and 
included lyrics printed on the side of the rolls. These “word rolls” thus contained 
only half of the music and were completed only when their owners sang along. 1 
Further evidence of active engagement comes in the form of the many articles 
and books intended to teach owners of player pianos how to play their instru- 
ments. These inelude The Pianolist: A Guide for Pianola Players by Gustav Kobbé, 
Ernest Newman’s The Player-Piano and its Music , and Sidney Grew’s The Art ofthe 
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Player-Piano: A Text-book for Student and Teacher. (Kobbé, 1907; Newman, 1920; 
Grew, 1922). As these books make clear, the player piano had its own idiomatic 
performance practice, one quite distinct from that of the traditional piano, some- 
times referred to in these guides as the £C hand piano.” 

Some phonographs allowed a modest form of co-performance as well. Aeolian’s 
Vocalion model carne with a device dubbed the Graduóla. It was simply a cable 
with a knob attached that, when pushed or pulled, operated a shutter that muffled 
and unmuffled the phonograph’s sound. Using the Graduóla created only a mildly 
noticeable alteration in volume and timbre, but its modest effect did not stop 
Aeolian from trumpeting it as an “epoch-making achievement” in a 1915 brochure. 
Aeolian offered this retrospective look at the technological triumph of the graduóla. 
“This was a device for artistically and effectually controlling tone volume. It not 
only added materially to the efficiency of the phonograph in its accepted field, but 
also greatly increased the artistic possibilities — it made of the phonograph a practi- 
cal instrument for personal musical expression — a musical instrument in the truest 
sense ofthe term ! (Aeolian Company 1915, [7] ). Not to be outdone, Brunswick carne 
up with a rival mechanism. A two-page 1916 ad in the Saturday Evening Post 
announced that a “group of experts was assigned to give human control, so that one 
might render any selection with his own interpretation, giving high lights or shad- 
ows to the music at will. As a result, the Brunswick phonograph has a perfected 
regulator which is a joy everlasting to those who wish to c play’ a phonograph” 
( Saturday Evening Post, October 16, 1916, 50-51). 

One will note that much of the evidence presented here of co-performance 
comes from advertisements. It is difficult to find precise information on how player 
piano and phonograph owners used or thought of these devices, and it is reasonable 
to take a skeptical view of all this breathless advertising copy. Nevertheless, we can 
conclude from these advertisements further evidence of a technological breach. 
Given that traditional instruments require human interaction, we can see why the 
mechanical music industry would go to such lengths to assure consumers that their 
products were not soulless autómata but real instruments that carne alive only with 
a human touch. The industry thus had to strike a fine balance: The devices had to 
be easy to opérate so as not to discourage users but also had to be sufficiently inter- 
active to offer a rewarding experience. As scholar Brian Dolan writes, player pianos 
“helped people engage with music . . . [b]y inviting middle-class consumers to 
effortlessly participate in, rather than passively listen to, musical performance 
(Dolan 2009, xviii). Put another way, as Pinch and Bijsterveld suggest, “The blend- 
ing of personal achievement (the loss of which had been feared by opponents of 
mechanical instruments) with democratized leisure (which had been seen as an 
advantage by proponents of mechanical instruments) in effect helped create a 
market for the player piano” (Pinch and Bijsterveld 2003, 543). 

In addition to the type of co-performance described earlier, some phonograph 
owners treated recordings not as finished musical products but as raw materials to 
be combined and altered to produce novel sonic effects. Phonograph scholar Patrick 
Feaster has discovered a number of examples in his research. For instance, in 1906 
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the members of a Chicago women’s club made recordings of their pet dogs barking 
and then, in a Dadaesque performance, played six machines simultaneously to 
create a kind of canine symphony. Around the same time, other home sound engi- 
neers were creating phonographic montages by recording several cylinders simulta- 
neously or sequentially onto a single recording, while others affixed two halves of 
different cylinders together to create amusing juxtapositions. Such home experi- 
ments predated splicing, overdubbing, and mash-ups by decades (examples cited in 
Feaster 2007, 660-61). 

These kinds of quasi-performative and quasi-compositional interactions were 
not the most pervasive ways in which users engaged with early sound technologies. 
More common was the second type of activity cited earlier: using the phonograph 
to record amateur performances. Whereas making piano rolls required a special 
kind of expertise and equipment, it was quite feasible for phonograph owners to 
make recordings at home. The earliest machines, specifically ones that used cylin- 
ders, were capable not only of playback but of recording as well. The phonograph 
industry, perhaps trying to cióse a breach, encouraged consumers to make their 
own recordings. Thomas Edison’s company, for example, sponsored a home 
recording contest in 1907 (Feaster 2007, 442n66). While such gatherings might seem 
fanciful, countless amateur cylinders were made. One such recording was made by 
A. H. Mendenhall of Pomeroy, Washington, who, sometime early in the twentieth 
century, recorded a phonographic “letter” to send to his friend Guy Willebrand, 
who lived nearby. After some initial pleasantries he launches into a comic song 
about two Irish bricklayers, then clears his throat and proceeds with a jaunty 
harmónica solo; he cío ses by wishing his friend a Merry Christmas and a Happy 
New Year. Or consider the recording made by the Tindle family of New York City. 
Around 1910 they gathered around the family talking machine and recorded 
a medley of three popular songs from 1909. The singing is jovial and far from 
polished and is occasionally interrupted by good-natured chatter. 2 

Home recording declined with the standardizaron of the disc-playing machine, 
which was designed for reproduction, not playback. (It was not until tape recorders 
became available in the 1950S that home recording became popular again.) Yet live 
and recorded music continued to remain partners, and some phonograph owners 
would sing or play along with recordings. As an amateur violinist explained in 
response to a 1921 Thomas A. Edison, Inc., questionnaire, “I often learn how to 
interpret a piece by listening to Mr. [Albert] Spalding play it on the Edison — then 
I play it along with him (Katz 2010, 78). An amateur English cellist wrote in 
Gramophone magazine in 1925 that he found playing along with recordings prefer- 
able to the kind of “anaemic amateur orchestral society” to which he had once 
belonged (Luetchford 1925, 74). In Germany, Hausmusik, or amateur music making 
at home, may have benefited from the presence of records early in the twentieth 
century. “For all the complaints about its supposed displacement by records,” his- 
torian Corey Ross writes, “it seems that Hausmusik was becoming more, not less, 
popular after the First World War. Indeed, records arguably aided the revival of 
Hausmusik” (Ross 2008, 51). One way this was possible was through specially made 
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records (dubbed “Spiel mit” [Play with] dises) that featured professional musicians 
playing duets or quartets with one part missing, which would be supplied by 
the amateur playing along at home (Gerigk 1937; Ross 2008). (The sheet music for 
the missing part was included with the dises.) Similar recordings later became pop- 
ular in the United States and elsewhere, produced by companies such as Music 
Minus One, Inc., and continued to be sold into the twenty-first century (www. 
musicminusone.com). 

Many music teachers carne out in favor of the phonograph as a means of engag- 
ing their students. In a 1916 forum in the American music journal Etude , J. Lawrence 
Erb argued that “the total effect of mechanical players has been to increase interest 
in music and stimulate a desire to make music on one’s own account” (“Effect of 
Mechanical Instruments upon Musical Education” 1916, 483; see also Cecil 1903, 
482). The teacher F. W. Wodell explained in the same article that he had “estab- 
lished a system whereby records are made by students at regular intervals, of both 
exercises and pieces, and reproduced for critical hearing and comparison by the 
pupil” (“Effect of Mechanical Instruments” 1916, 484). Oscar Saenger published a 
course of vocal study in which the student listened to and then imitated various 
exercises on several specially made dises; Hazel Kinscella published a similar method 
for piano (Saenger 1916; Kinscella 1924). 

Clearly, the amateur did not disappear in the early days of mechanical music. 
For many music lovers, the player piano and the phonograph did not deaden the 
urge to perform; quite the opposite. Amateurs entered into a variety of partnerships 
with these technologies, whether they treated music machines quasi-instrumen- 
tally, used phonographs to record themselves, or played along at home with them. 
These new relationships helped reconfigure musical culture. Hybrid types of musi- 
cal performance emerged, as did new ways of experiencing music and understand- 
ing its role in everyday life. 


Case Study 2: Karaoke 


In Japan, where karaoke first emerged in the early 1970S, impromptu amateur sing- 
ing had long been accepted and even expected at certain kinds of social gatherings. 
Karaoke developed out of this tradition, and karaoke technology aided — and inev- 
itably influenced — the practice. The earliest machines used cassette tapes but were 
similar to disc-playing jukeboxes in that they played music when coins were inserted. 
However, unlike jukeboxes, and more like the “Spiel mit” or “minus one” record- 
ings mentioned earlier, karaoke machines were designed and intended to accom- 
pany Uve performance. The crucial feature of any karaoke technology — whether it 
uses cassettes, records, CDs, láser dises, MP3S, or some other format — is that the 
recordings it plays are not complete in themselves. The term karaoke , which 
can roughly be translated as “empty orchestra,” points to this incompleteness. 
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The recordings provide instrumental backing for singers and are almost never 
listened to unaccompanied. Simply insofar as karaoke demands active music par- 
ticipados it provides a counterexample to the argument that recording technology 
necessarily encourages a passive relationship with music. Those who interact with 
karaoke technology are anything but passive, and it could be said that any given 
group of people listening to karaoke is less an audience than a collection of 
performers-in-waiting. While there are professional karaoke singers and teachers, it 
remains an almost exclusively amateur phenomenon. When money changes hands, 
it is the performers who pay and the owners of the karaoke machines and venues 
who make money from those who sing. 

Karaoke became hugely popular first in Japan and then in the rest of Asia and 
the world. In Japan in 1995, Hiroshi Ogawa reported, “55.4 per cent of Japanese over 
the age of 15 experienced karaoke [and] people who used karaoke systems did so 
an average of 10.5 times per year” (Mitsui and Hosokawa 1998, 47). On any given 
week in the late twentieth century, then, tens of millions of Japanese amateurs were 
singing into a microphone. Around the globe, many more millions have done the 
same. 

The question then arises of what impact these legions of songbirds may have on 
musical practices. I would point to two main areas of influence: the music industry 
and song composition. Given the huge amounts of money spent on karaoke, the 
humble amateur singer wields great power in the marketplace of music. Ogawa 
writes, “At the start of the 1980S only a few recorded singles had registered a million 
sales in any given year. But since 1991 there have been more than ten million-sellers 
of CD singles per year. . . . The reason for this increase,” he argües, a is karaoke.” In 
other words, great demand — in the form of millions of karaoke singers gravitating 
toward a handful of songs — begat great supply in the form of millions of copies of 
certain popular songs being sold in Japan. This demand has also driven the produc- 
tion of specific types of songs, whether duets for women, disco songs, or enka 
(a Japanese popular music genre) — all of which witnessed a surge in the numbers 
of songs and hits in response to karaoke (Mitsui and Hosokawa 1998, 49-50). Of 
course, catering to amateur musicians is nothing new. Countless instrumental 
composers and songwriters of the early twentieth century and before either wrote 
or arranged music with the capabilities of nonprofessionals in mind. (John Philip 
Sousa, as cited earlier, is a perfect example.) What is different is that this market for 
amateur performance owes its very existence to the kind of sound-reproducing 
technology that some feared would render the amateur extinct. 

Amateur karaoke singers do not merely affect supply and demand but also 
influence the very sound of the music they sing and the compositional practices of 
songwriters. As Ogawa points out, because of karaoke, a one of the important ele- 
ments for hit songs changed from £ good to listen to’ to £ good to sing’ ” (Mitsui and 
Hosokawa 1998, 49). That is, songwriters have to think about not only the profes- 
sional singers who will record their music but also the abilities of the amateur musi- 
cians who will replace those professionals in their local bars, coffee shops, or karaoke 
boxes. (A karaoke box is a room rented out to prívate groups for karaoke singing; 




468 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


it is prevalent in commercial areas throughout Japan, Korea, China, and elsewhere 
in Asia.) In the music genre known in Hong Kong as Cantopop, songs became less 
complex as karaoke versions of them became more popular. Cantopop composers 
started writing songs with narrower melodic ranges, fewer difficult leaps, and 
shorter phrases in order to accommodate the amateur singers’ typically modest 
abilities . 3 Similarly, Shinobu Oku has suggested that karaoke has exerted a conser- 
vative influence on enka in Japan: “Because of the strategy of matching the songs to 
fans’ singing ability, new singing techniques and unfamiliar musical elements are 
generally risky and unwelcome. Therefore, many traditional musical elements have 
remained in enka ’ (Mitsui and Hosokawa 1998, 71). In Hong Kong, karaoke trans- 
formed a classical genre into a popular genre. Karaoke brought Cantónese opera, 
once a highbrow traditional form into homes, clubs, and restaurants and made 
everyone a potential opera singer. As Patrick Ho, Hong Kong’s secretary for home 
affairs, explained in 1995, “Nowadays Cantónese opera lovers are as likely to organ- 
ise their own recitáis at home as they are to line up for tickets to the next staged 
performance. From an art form designed to be watched and appreciated, Cantónese 
opera has evolved into one that invites participation” (Ho 2005). These are pro- 
found, even definitional changes occurring in some of the world’s most popular 
music genres, and they have been spurred by karaoke. 

In 1984 ethnomusicologist Charles Keil wrote about his encounters with what 
he called “mediated-and-live musical performances” in Japan, one of which was 
karaoke (Keil 1984). Keil made several perceptive observations that are relevant 
here, for they offer ways of understanding the relationship between live (especially 
amateur) music making and sound-reproducing technologies. Many of the perfor- 
mances and rituals he attended combined traditional vocal or instrumental perfor- 
mance with the playback of recordings; as an American with preconceived notions 
about how one interacts with music technologies, he was surprised that he detected 
a no visible friction” between those who sang or performed live and those who oper- 
ated record players or cassettes. “The scenes I witnessed seemed very traditional . . . 
the atmosphere suggested, c this is the way it is always done 5 ” (Keil 1984, 93). Keil 
also concluded that the interactions between live and mediated music that he 
observed did not deaden the human musical spirit but quite the opposite: “What is 
striking to me in all these instances of mediated-and-live musical performances is 
first of all the humanizing, or better still, personalization of mechanical processes” 
(Keil 1984, 94). Particularly in the case of karaoke, he saw the practice as “an asser- 
tion of individualism, skill, and personal competence before others in a demanding 
situation where any slip or hesitation will not be compensated for by the ‘empty 
orchestra’ ” (Keil 1984, 95). 

Finally, Keil realized that what he witnessed was not as foreign to him as he had 
first thought. Turning his scholarly gaze inward he asked, “Why haven’t I paid any 
ethnomusicological attention to the fact that since I was a teenager I have spent 
more time playing bass or drums to records than I have playing with other people?” 
(Keil 1984, 95). The answer is in part that, at least before karaoke became popular 
in America in the 1980S, there had been no single widespread phenomenon like 




THE AMATEUR IN THE AGE OF MECHANICAL MUSIC 


469 


karaoke in which amateurs performed alongside recordings; the answer may also be 
attributed to the idea, reinforced since Sonsas time, that the live and the recorded 
could not easily coexist. But they could and they did. In the United States, people 
had long made music with recording technology — they had sung, played, and con- 
ducted alongside machines and had done so frictionlessly, just like their later 
Japanese counterparts. 


Case Study 3: Hip-Hop 


In the early 1970S, not long after the first karaoke machines were being marketed in 
Japan, a discovery in the New York City borough of the Bronx led to the develop- 
ment of another globally influential, live-and-mediated — and, to a great extent, 
amateur — musical practice. Young disc jockeys spinning at parties noticed that 
dancers were especially energized by what were called breaks, the short percussion 
solos found on many funk, soul, and rock records. In itself, this discovery was not 
particularly significant; as is true of most discoveries, its real impact carne from its 
practical application. Rather than simply let the breaks come and go in a matter 
of seconds, as was typical on most records, the DJs intervened. Through a great 
deal of trial and error, DJs found that they could extend the breaks almost indefi- 
nitely. At first, they would simply pick up the needle and return it to the beginning 
of the break, repeating it as desired. (This technique, developed by a DJ known 
as GrandWizzard Theodore, was called the needle drop.) Later, Grandmaster 
Flash and others developed more complex ways of repeating breaks, using two 
copies of a record on two turntables and employing a piece of equipment called 
a mixer to switch quickly and seamlessly between the two dises (Flash and Ritz 
2008, 75-81). 

Ironically, DJs extended the breaks by harnessing the latent powers of the very 
technology that demanded such brief percussion solos in the first place. It is impor- 
tant to understand that the short four- or eight-bar break is native to sound record- 
ings; they are common on disc because of their often severe time limitation. It was 
the fundamental insight of the hip-hop DJ that the turntable itself contained the 
key to overcoming its procrustean nature. The response to this technological inter- 
vention was swiít and pro found. A new form of virtuosic solo dancing aróse around 
the repetition of the breaks, one that has continued to evolve in the decades since. 
Those who danced to the breaks called themselves b-boys and b-girls, and their art 
later carne to be known throughout the world as break dancing. However, DJs did 
more than simply extend breaks. They also transformed the sound of prerecorded 
dises in a more radical way — through scratching. Scratching, introduced in the 
mid-i970s by GrandWizzard Theodore, involved manually pushing and pulling a 
record back and forth underneath the Stylus. The result was a percussive rasping 
that no longer resembled the original recorded sound. 
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For the most part, in the decades before hip-hop, records were simply played 
from beginning to end; with few exceptions, little thought had been given to any 
other way of interacting with recordings (for exceptions, see Katz 2010, 109-123). By 
the 1970S, the phonograph had existed for nearly a century; the period of its greatest 
interpretive flexibility would seem to have long been closed. It was a stabilized tech- 
nology, one deeply embedded within established social practices. Then DJs destabi- 
lized the phonograph by violating a taboo as oíd as the phonograph against 
inappropriately touching records. For many (then and now), to touch a record is to 
defile it. One properly holds a disc by the edges, the fingertips making only the 
slightest contact with the vinyl. Yet the fundamental techniques these DJs devel- 
oped — mixing and scratching — are based on a simple dictum: To best control the 
sound, it is necessary to touch the records. As Grandmaster Flash points out, “This 
was a major no-no. You never touched the record with your fingers.” Yet he realized 
that the best way to manipúlate a break was to put “your greasy fingertips on the 
record” (Flash and Ritz 2008, 76, 79). DJs like Flash and others refused to accept the 
“proper” use of the phonograph and made an art out of treating the turntable not 
as a means for the passive consumption of music but for the active creation of 
music. In other words, they transformed the turntable into a musical instrument. 

Without the DJs' manipulation of records, hip-hop, broadly speaking, could 
not have flourished. We must understand that in 1973 — now generally accepted as 
the year of hip-hop’s birth — there was no sepárate musical category actually called 
hip-hop. It was not yet a distinctive genre but more a performance practice, a way 
of approaching other types of music. The performer was the DJ, and the practice 
was to isolate and repeat choice instrumental parts of popular songs — the breaks — 
at dance parties. Later, vocal parts were added over the top of these repeated breaks 
in the form of rapping. Eventually, what began as a practice became its own genre, 
one that combined dance with instrumental and vocal music. 

Rapping, like DJing, is a form of live-and-mediated musical practice, though 
instead of manipulating recording technology to create new music, it aróse as a way 
of accompanying turntable manipulation. Rapping, however, did not begin as a 
musical practice. Rappers were originally called MCs, and in fact that remains the 
preferred term. Short for “master of ceremonies,” MC seems like an odd way to 
describe a vocalist, for the term usually conjures up not a musician but a host or an 
announcer. Standing before a microphone, an MC — in the traditional sense — 
might welcome an audience to an event, introduce acts at a variety show, or make 
announcements. At the earliest hip-hop parties, that is exactly what MCs did, and 
that is how they got their ñame. Even from the beginning, however, MCs did more 
than just make announcements. The more boisterous ones shouted out to the audi- 
ence, trying to help their DJ by stoking the energy in the room. Some developed 
catchphrases: Kid Creóle was known for calling out variations on “Yes yes y’all, and 
you don’t stop, to the beat y’all and you don’t stop.” Cowboy was probably the first 
to command dancers to “Throw your hands in the air and wave em like you just 
don’t care” (Fricke and Ahearn 2002, 209). And like good catchphrases, they caught 
on and became part of the hip-hop vocabulary. Then MCs started to do more, 
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chanting brief rhymes to the crowd, and gradually their rhymes grew — from cou- 
plets to stanzas to nearly epic-length poems. Their numbers expanded as well: Often 
five rappers would stand before the crowd at a time, and crews of MCs — though 
still associated with a DJ — proliferated. A line had been crossed: Announcers 
became artists and no longer assisted the DJ but commanded equal, if not more, 
attention from the crowd. 

The pioneering DJs and MCs began as amateurs and developed their art in the 
bedrooms and basements, as well as the streets and playgrounds, of the Bronx. 
Although some went on to generate an income from their work, most did not. 
More to the point, it seems unlikely that hip-hop would ever have been developed 
by professional disc jockeys and MCs. Mixing and scratching records or chanting 
over the microphone for extended periods contravened long-established practices 
and rules; simply put, such behavior would have been deemed unprofessional. 
What is also significant here is that these amateur musicians typically had not 
already pursued traditional types of instrumental or vocal performance. In other 
words, the birth and flourishing of hip-hop offers a case study of the rise of a new 
class of musicians, one directly tied to a technology long held to discourage, not 
promote, the musical amateur. 


Case Study 4: The Digital Amateur 

Music Video Games and 
Mobile Phone Music 


Since the late 1990S an explosión has occurred in the amount of digitally based 
musical activity designed for or engaged in by amateurs. In video game arcades, on 
home computers, and even on phones, music is being performed and composed on 
a vast scale. Two examples will illuminate some of the issues that this new age of 
musical amateurism raises: music video games and music applications for mobile 
phones. I have picked these in order to illustrate two broad points: that these forms 
of musical activity challenge traditional notions of musical agency and creativity 
and that this amateur music making has become a driving forcé in modern musical 
culture. 


Music Video Games 

In the first decade of the twentieth century one of the hot consumer Ítems in the 
United States and in parts of Europe was a modified musical instrument that 
allowed anyone, regardless of ability, to take part in high-level music making. This 
was the player piano, and the user, by altering tempo or timbre by pumping pedáis 
or sliding levers, contributed, however modestly, to the execution of a polished 
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musical performance. In the first decade of the twentieth-/*rsí century one of the 
hot consumer Ítems was another modified instrument that allowed nonmusicians 
to participate in musical performance. The instrument was not a piano but a guitar. 
Actually, it was a video game controller in the form of a guitar used in the wildly 
popular game Guitar Hero (2005) and its many sequéis. In contrast to the player 
piano, the plástic, guitar-shaped controller does not allow users to modify tempo 
and timbre — these are fixed. Rather, the user Controls — to a certain extent — the 
melody; pressing buttons on the controlleres fretboard in time with moving images 
on a televisión screen activates the pitches in a prerecorded popular song. If the 
player presses the right buttons at the right time, the full and correct melody is 
heard. With both the player piano and the guitar controller, users are given power 
over a limited number of musical parameters. In writing of the player piano Brian 
Dolan calis this phenomenon the de-skilling of musical performance. Certainly 
both player pianos and guitar controllers are easier to play than the instruments 
they are modeled on, but they also require actions not demanded of pianists and 
guitarists — for example, sliding levers and pressing buttons. A distinct type of 
expertise is thus required to master these. We can see this as a reconfiguration of 
skill rather than a simple subtraction of expertise; such an approach allows us to 
understand these technologies on their own terms rather than as mere imitations or 
simulations (Seaver 2010). Put another way, as Andrew Goodwin argües, “new 
technologies have not removed the notion of £ skilP ” (Goodwin 1990 [1988], 262; 
see also Théberge 1997). 

Games like Guitar Hero have excited strong opinions about their legitimacy. 
On one side are those who argüe that these games are a positive forcé in modern 
musical life. Among them, naturally, is Alex Rigopulos, cofounder of Harmonix 
Music Systems, the company that created Guitar Hero and several related games. 
“Everyone comes into the world with this innate desire to make music and almost 
everyone tries to learn an instrument at some point. And the overwhelming major- 
ity of these people quit after a few months or a few years because it is just too damn 
difficult. They spend the rest of their lives loving music . . . but not having any outlet 
for that innate urge they feel” (quoted in Radosh 2009). Rigopulos touts both the 
de-skilling and democratizing effects of the games and suggests that these technolo- 
gies remove the barriers that have been keeping so many firom expressing their 
innate musicality. Once again, we see parallels with those who promoted player 
pianos. “Before the Pianola carne,” explains an early twentieth-century Aeolian ad, 
“how very few there were who even caught a glimpse into the grand world of har- 
mony.” However, with the Pianola, “The world of music is free to all” (Roehl 1973, 
7). Furthermore, just as vehemently as Sousa and others condemned early forms of 
sound technologies, critics of the games dismiss their musical valué. “Guitar Hero,” 
argued professional rock guitarist John Mayer, “was devised to bring the guitar- 
playing experience to the masses without having to put anything into it” (quoted in 
Miller 2009, 405). 

However, as ethnomusicologist Kiri Miller contends, neither the arguments for 
ñor those against these games illuminate what makes them distinctive. She has 
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coined the term schizophonic performance to describe the act of playing guitar, bass, 
or drum controllers, an act that combines “physical gestures of live musical perfor- 
mance with previously recorded sound” (Miller 2009, 401). Borro wing from 
R. Murray Schafer’s term schizophonia , Miller highlights the split between the live 
and the recorded. Yet she argües that the two are not as distinct as we might think, 
that these music video games occupy a middle ground. On the one hand, operat- 
ing game controllers can be like playing real instruments (though more so for 
experienced players than beginners.) For one, players sense a causal relationship 
between their physical actions and the sounds they hear. Second, players can 
experience music from the inside of the song and feel that they are both part of and 
carried along by the musical texture. Writer Helene Stapinksi describes this sense of 
immersion when she played the drums: “I loved the drums because they were all- 
consuming. They surrounded me . . . When I played the drums, I knew what those 
surfers felt like, inside the tube, the wave breaking over them, but at the same time 
carrying them, faster and faster, toward the shore” (Stapinksi 2005, 12). (For more 
on immersion see Grimshaw, this volume). These are the same type of sensations 
often described by accomplished players of Guitar Hero and similar games. Third, 
game players may develop the ability to focus on and shift attention among 
different parts within an ensemble, a skill required of accomplished musicians. 

At the same time, there are clear differences between gameplay and instrumen- 
tal play. One does not create new sounds but activates prerecorded sounds in a 
tempo and order not of one’s choosing. Depending on the game, players have little 
leeway for spontaneous musical acts, whether through improvised melodies, har- 
monic progressions, or changes in phrasing, accentuation, timbre, or dynamics, all 
of which are hallmarks of live performance. 

As Miller explains, “Playing Guitar Hero and Rock Band isn’t just like playing a 
real instrument, but it’s nothing at all like just listening to music.” “Schizophonic 
performance,” she concludes, “is collaborative performance: the players and audi- 
ences join the game designers and recorded musicians in stitching musical sound 
and performing body back together” (Miller 2009, 424). Whether it is called schizo- 
phonic performance, mediated-and-live musical performances (cf. Charles Keil), 
co-performance, or hybrid performance, the acts performed by the millions who 
play these games represent a growing norm. This type of collaborative amateur 
performance is nothing new, of course, and dates back to the earliest player pianos 
and phonographs. Nonetheless, a point may well have been reached in which the 
majority of amateur music making involves some sort of technological mediation. 


Mobile Phone Music 

In January 2007, Apple, Inc., introduced its multimedia and Internet-capable 
mobile phone, the iPhone. One of its most distinctive features is the App Store, 
a Service that pro vides an ever-growing collection of ff ee and paid third-party appli- 
cations of every imaginable variety (www.apple.com/iphone/apps-for-iphone). 
Hundreds, perhaps thousands, of these apps are music related. Some of them 
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connect to Internet radio stations or identify the lyrics to songs stored on the phone 
(which also doubles as a music player); others serve as electronic tuners or offer 
music games similar to (or in fact versions of) Guitar Hero and its kin. Of particular 
interest here, however, are those applications that allow users to create music. These 
cover a huge range of possibilities. Sound Shaker, for instance, allows children to 
create and mix sounds by tapping the touch-sensitive screen and shaking the phone. 
(The iPhone has an accelerometer that detects movement and spatial orientation.) 
PocketGuitar, iBone, and Filtatron offer guitar, trombone, and synthesizer simula- 
tions. Singing into the phone’s microphone using the app called I Am T-Pain results 
in a digitally altered voice that emulates the rapper T-Pain’s use of the robotic- 
sounding Auto-Tune effect. When surveying user comments (typically collected on 
sites dedicated to each app), it becomes clear that most engage these apps for recre- 
ational purposes and not as a means of earning a living as a performer or composer. 
That is to say, these apps almost exclusively serve amateurs. 

The iPhone was not the first and is not the only mobile phone to offer musical 
applications. It has been, however, one of the most popular and influential such 
devices, and thus offers a valuable case study for exploring mediated amateur music 
making in the digital age. In particular let us consider Ocarina, an app that seem- 
ingly transforms the iPhone into a wind instrument. Ocarina was released by Smule 
in November 2008 and quickly caught on — in its first six months it was down- 
loaded (at a cost of 99 cents in the United States) on more than a million devices 
(http://ocarina.smule.com/). Its popularity was due in part to its novelty and ease 
of use. The player blows into the phone’s microphone and covers various combina- 
tions of four simulated airholes to produce a mellow, pleasant tone with a range of 
an octave and a fourth. It is also possible to switch between different scales and to 
play with vibrato by tilting the phone downward; it thus offers a respectably wide 
variety of pitches and timbres. Ocarina’s popularity can also be traced to Nintendo’s 
multimillion-selling video game Legend ofZelda: Ocarina of Time , released in 1998. 
The game is said to have spawned a craze for the real ocarina, a vessel ilute typically 
in the shape of an elongated egg that was popular among amateurs in the early and 
mid-twentieth century. The residual popularity of the video game (in which an 
ocarina plays a significant role) and the real ocarina seems to have primed the 
market for Smule’s versión of the instrument. 

Smule claims that Ocarina is the “first true musical instrument created for the 
iPhone” (http://ocarina.smule.com/). Whether or not it was the first, it is hard to 
dispute that the iPhone, when Ocarina is engaged, truly is a musical instrument. 
Playing Ocarina is not an act of co-performance, where users control only certain 
nuances while pitch, melody, and rhythm are predetermined. Because most musical 
parameters are under the player’s control, the iPhone Ocarina is closer to a tradi- 
tional ocarina than a Guitar Hero controller is to a guitar; it would be fair to cali 
Smule’s Ocarina simply a type of aerophone, or wind instrument. 

If the iPhone is a musical instrument, it is hardly a simple one. Given that 
Ocarina is just one of many instrumental apps, we might cali the iPhone a multi- 
instrument, for it can be any of a hundred types of music maker. Moreover, unlike 
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most instruments, the iPhone is Internet capable. This means that iPhone musi- 
cians can connect to one another across the globe. Ge Wang, Ocarina’s designer, 
touts this as a particular virtue: “Ocarina is also a type of social instrument that 
enables a different, perhaps even magical, sense of global connectivity” (Wang 2009, 
303). Tap on the globe icón, and an image of the earth floating in space appears. 
Glowing pinpoints indicate the location of Ocarina users as if each one represents a 
city lighting up the night sky. (The sun never seems to rise in Ocarina’s “World 
Listen,” as it is called.) Every few seconds, snippets of music sound, each a passage 
played by an Ocarinist somewhere in the world, the notes ascending heavenward 
from the player’s location (provided by the phone’s GPS) in a glowing column of 
üght. 

As a sampling of the music being played around the globe on the Ocarina, 
consider what I heard using World Listen over the course of a few minutes one 
day late in 2009: “Twinkle, Twinkle, Little Star” (England), the theme from Star 
Wars (France), “The Lion Sleeps Tonight” (Portugal), “Ode to Joy” (Tokyo and 
Vancouver), and “Oh, Susanna” (Malaysia). The halting quality of many of the 
performances attests to the amateurism of the Ocarinists and also imparts a sense 
of authenticity — the music clearly seems to have been played by actual people. One 
can track users and designate them as favorites and communicate with them in one 
of the Ocarina Internet forums (www.smule.com/forum). One need not buy into 
the utopian visión that Ocarina’s promoters push, however, to see at least the 
potential of the Ocarina and, by extensión the iPhone, to connect people through 
music. Indeed, hundreds of videos on YouTube.com show iPhone Ocarinists play- 
ing in small ensembles, including a consort performing Led Zeppelin’s “Stairway to 
Heaven” or Ge Wang’s own Mobile Phone Orchestra (or MoPho) at Stanford 
University. 

In addition to the Internet connectivity of the iPhone-as-instrument, another 
distinctive feature is its portability. The iPhone is essentially a small Computer, 
which makes it not unlike any of countless home devices that also allow users to 
make music. The iPhone’s portability, however, means that millions of people are 
carrying musical instruments in their pockets, making music wherever they please. 
Again, there seems to be the potential for, if not a massive democratization of music 
at this point, then a considerable surge in amateur music making through the 
iPhone. 

At this early date, however, it would be unwise to issue grand pronouncements 
about the musical impact of the iPhone. Although Apple had sold more than sev- 
enty-three million iPhones by the end of its 2010 fiscal year (http://en.wikipedia. 
org/wiki/Iphone; accessed Feb. 18, 2011), relatively few users regularly employ them 
as musical instruments. Moreover, the device is, by nearly any standard of living, 
very expensive. One could easily buy a modest ñute, guitar, or violin — or a few 
dozen plástic ocarinas — for the same price. Moreover, the polished metal and glass 
iPhone is not the sturdiest of music devices and certainly was not designed for long, 
physically intense rehearsal or performance sessions. “It doesn’t feel like an instru- 
ment,” objects Tod Machover, MIT Computer scientist and composer. “ [T]hink of 
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a piano ... I mean, a piano is a machine, but you’ve got ivory, and there’s weight 
behind the keys, you have this really — you feel the resonance in the instrument” 
(Machover 2010). At this point, then, we should recognize the significance of the 
iPhone instrument in terms of the potential it represents. It is not hard to imagine 
a time when cheap, robust, and interconnected devices with the kind of appealing 
tactility of successful traditional instruments are in the hands of the multitudes. 
The unanswerable question, however, is whether the multitudes will make music 
with them. 


CONCLUSION 


The case studies offered here in no way cover the ful! spectrum of live-and-medi- 
ated amateur music making, which has flourished since the late nineteenth century. 
Other examples inelude the once-popular practice of phonograph conducting, in 
which one would “lead” an invisible orchestra, baton in hand, in the privacy of 
one’s home or, in later decades, the phenomenon of the mash-up, in which one 
creates new music by digitally combining elements of two or more different songs 
on a home Computer (for more on the mash-up, see Pinch and Athanasiades, this 
volume; see also Katz 2010, 165-174). Nevertheless, the case studies on the player 
piano and phonograph, karaoke, hip-hop, music video games, and musical mobile 
phone applications illustrate a number of important points about the development 
of musical amateurism in an age of mechanical music. 

The first and most obvious point is that amateur music making has not disap- 
peared. This is significant, given how deeply ingrained the idea of the mutual exclu- 
sivity of musical amateurism and musical technologies has been. Yet it is hardly the 
most consequential conclusión to be drawn from this study. More important is the 
understanding that although amateur music making continúes to thrive, it has not 
done so unchanged. One of the clearest changes — clearly seen in the demographic 
of middle-class Americans — is the transformation of an amateurism dependent on 
sheet music to one dependent on recorded and synthesized sound or, more broadly 
speaking, the transformation of a written musical culture into an oral one. A related 
point is that amateurism changed from a live-only practice to a mediated-and-live 
practice. We must understand, then, that the flourishing of musical amateurism is 
in fact deeply tied to the rise of vario us types of mechanical music. Yet while ama- 
teurs have adapted many of their practices in response to these technologies, one 
thing has not changed: the social, communal aspeets of music making. It is certainly 
possible to engage in mediated-and-live performance in isolation, but this has not 
been the prevalent practice. Both DJs and MCs developed their craft in response to 
the needs of dancers and partygoers; friends, family, neighbors, and colleagues 
gather to sing karaoke or play Guitar Hero> and Ocarina players around the 
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globe connect with one another. It would be fair to say that it is the companionable 
interaction with others, as much as the ability to express oneself through sound, 
that appeals to amateur musicians. Whether through physical interaction or online 
social networking, users have found ways to maintain this crucial aspect of music 
making. 

Another important conclusión is that in the twentieth and twenty-first centu- 
ries technologically mediated amateur performance became a hugely powerful Cre- 
ative and commercial forcé. To take the clearest example, hip-hop, a globally 
influential musical and cultural phenomenon, can trace its roots to the schizo- 
phonic performance of the turntable. Karaoke has exerted tremendous influence as 
well by shaping the form and function of various musical styles, while music video 
games have collectively sold billions of dollars’ worth of merchandise and music 
and have influenced pop music sales (Radosh 2009). New songs and remixes are 
also being composed specifically for phone applications. 

To note the prevalence and power of the musical amateur, however, is not to 
require that we celébrate it. In 1888, when the phonograph was little more than a 
decade oíd, composer Arthur Sullivan witnessed a demonstration of Edison’s 
machine and prophesied what could happen if the technology caught on: “I am 
astonished and somewhat terrified at the results of this evening’s experiment — 
astonished at the wonderful power you have developed, and terrified at the thought 
that so much hideous and bad music may be put on record forever” (Sullivan 1888). 
Sample the infinitude of amateur music preserved on records, cassette tapes, CDs, 
and MP3S and know that Sullivan prophesied correctly. Moreover, with the so- 
called Web 2.0 — the second generation of Internet design, which encourages the 
exponential growth of user-generated content as exemplified by sites like YouTube 
and MySpace — all of this amateur music is accumulating fast. In his 2007 book, 
The Cult ofthe Amateur , Andrew Keen wrote of the development of Web 2.0: “The 
new Internet was about self-made music, not Bob Dylan or the Brandenburg 
Concertos. Audience and author had become one, and we were transforming cul- 
ture into cacophony” (Keen 2007, 14). We may agree or disagree with the last part 
of his statement, but this is beside the point. Whether one views record scratching 
or the alcohol-soaked karaoke versión of “My Way” as a joyous noise or just noise, 
we must realize that, golden or not, the turn of the twenty-first century must be 
reckoned as the age of the amateur. 


NOTES 


1 My thanks to Nicholas Seaver for pointing this out to me. 

2 Both recordings discussed here have been reissued on Vm Making You a Record: 
Home and Amateur Recordings on Wax Cylinder, 1902-1920 , Phonozoic compact disc 001. 

3 This observation comes from an unpublished paper by composer and Hong Kong 
native Angel Lam, Dec. 6, 2005. 
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ONLINE MUSIC 
SITES AS SONIC 
SOCIOTECHNICAL 
COMMUNITIES: 
IDENTITY, 
REPUTATION, AND 
TECHNOLOGY AT 
ACIDPLANET.COM 


TREVOR PINCH AND 
KATHERINE ATHANASIADES 


Introduction 


This chapter focuses upon one form of digital music making — that created and 
shared among a group of Internet users. Websites dedicated to this new form of 
musical production and consumption are generally part of what has become known 
as Web 2.0. Most Web 2.0 sites, such as Facebook and MySpace, are social network- 
ing sites and may inelude the uploading and downloading of music as a feature. 
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Here we write about a website, http://www.ACIDplanet.com (henceforth known as 
“ACIDplanet”), which is dedicated to the production of music made by its users, 
who share their songs, review each other’s creations, and enter special remix and 
mash-up competitions promoted by the website. It is one of the largest such web- 
sites, and much of the music is made by a special software program sold by Sony 
(who also owns the website) known as ACID. This software turns a Computer into 
a digital recording studio and enables users to make music by downloading repeti- 
tive samples of music, known as “loops.” The chapter is not an exhaustive review of 
Online music; rather, it presents an ethnographic study (conducted 2005-2007) of 
the working of this one online music site. 1 

We are particularly concerned in this chapter with three themes: (1) how repu- 
tations are achieved in this online world, (2) how musical identities are negotiated, 
and (3) how much, if anything, there is in common between the online and offline 
world of musical identities and reputations. On the first theme we show the impor- 
tance of a form of “transduction,” whereby the sounds and music these musicians 
make are rendered into words and symbols that enable rankings to be carried out. 
On the other two themes, we show that, despite the radical possibilities that the 
online musical world ofifers for, say renegotiating musical identities, many standard 
(offline) practices and identities are actually reinforced at the website. 


ACIDplanet’s History 


The independent software company Sonic Foundry of Madison, Wisconsin, initi- 
ated ACIDplanet. In 1998 Sonic Foundry launched the first versión of ACID soft- 
ware. The program uses dropdown menus to facilítate the building up of complex, 
multitracked compositions by recording numerous individual tracks in real time. 
The user can modify or add sampled sounds or sounds from real instruments to 
any individual track, thus turning a Computer into a complete recording studio. 
Based on the new opportunities that this software afforded music hobbyists, a sepá- 
rate Madison-based firm developed the first versión of the website (named 
ACIDplanet for the first time), which Sonic Foundry brought in-house and launched 
in 1999. Dave Hollinden, who was ACIDplanet 5 s webmaster from 1999 to 2005, 
explains: 

People are able to make music that didn’t have any way to do it in the past . . . 
even if . . . they don’t really have a musical, physical nature, you know as far as 
playing an instrument or something . . . it gives people a chance to . . . have that 
feeling of making music and getting it in front of people and getting a pat on the 
back, and patting other people on the back, and making ffiends, and ... so on . 2 

The original purpose of the website (created at a time when every new business 
needed one) is not entirely clear. However, over time it has evolved as the primary 
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place to post, download, and review the musical creations of people using ACID 
software. In 2003 Sonic Foundry was bought by Sony for a reported $18 million. 
From the perspective of the ACIDplanet webmasters, Sony's intentions for the site 
are not always clear. Direct sales of Sony software producís are a small but signifi- 
cant part of the website’s business, but the site overall is more important as a pro- 
motional vehicle for this form of music making. 


Our Study 


In our ethnographic work we have made a special effort to lócate, observe, and 
interview thirty-five users of ACIDplanet in the places where they make their music 
and access the website. We have thus spent a lot of time in home studios, as well as 
the Internet cafes where some members of this new breed of musician like to opér- 
ate. We mainly interviewed users who lived cióse to our base in the northeast región 
of the United States. Other interviews were conducted opportunistically in Korea 
and the UK, as well as with Sony personnel. Interviews were held face to face and 
typically lasted between one and a half and two and a half hours. 

Thirty-two of our interviewees were male, twenty-five resided within the United 
States, and thirty were Caucasian. The age range was eighteen to fifty-two. Our 
interviewees had a great variety of experience in terms of musical tastes and back- 
grounds, personal histories, motivations for using the site, and amount of time 
spent on the site. Some obsessively used the site, posted all of their music there, and 
used it as a promotional tool for their own independent CD releases; others were 
casual users who had discovered the site by chance when playing with ACID soft- 
ware and in effect used the site as a place to store their compositions. However, even 
these casual users would alert their friends and family to the existence of their musi- 
cal compositions, which anyone could listen to by visiting the site. 


Using the Site 


ACIDplanet provides for its “citizens” the options to post, listen, review, and obtain 
a chart position for musical compositions, particípate in forums, enter remix and 
mash-up contests, make lists of favorite artists, and more. All of these activities, 
importantly, are free, and the site has no requirements of its users other than to 
follow basic guidelines of etiquette. It makes no promises of fame or fortune, either; 
Sony simply provides the web space and location, as well as the music-creation 
software that encourages many people to get involved in music making in the first 
place. 
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Upon visiting the site, potential users are able to perform a limited number of 
activities, such as listening to other users’ songs (any song can be listened to imme- 
diately without downloading any software). To explore the full range of options, a 
user must sign up for a profile, thus becoming a “citizen” of ACIDplanet. Anyone 
with a valid email address can create a profile, and a user can have up to three 
affiliated profiles attributed to that one email address. 3 If someone has múltiple 
email addresses, that person can create múltiple groups of independent profiles. 
For each profile, citizens choose a ñame they will be known as by other users on the 
website. They can choose to provide information on their profile such as their iden- 
tity, their musical style, influences, and equipment used. It is up to the individual 
citizens to decide what, if anything, they inelude in the ACIDplanet profile. They 
can, therefore, remain as anonymous, fake, honest, or informative as they wish. The 
users’ profile on ACIDplanet becomes their main online identity in that world. 

The predominant activity at the site is the users’ posting of their own music 
(attached to a profile). Each song has its own page, and on this page the user leaves 
a description of the song, including a self-rating from G to R (as with movie rat- 
ings), and a genre (see figure 20.1). The genre is important because it defines the 
ACIDplanet charts on which the song will appear. 

With regard to posting music to the site, we can see one of the major differences 
between ACIDplanet and other MP3 download sites: The material that users post is 
expected to be their original work. ACIDplanet will not condone the posting of any 
music that is copyright infringed. Of course, because of the sheer volume of music 
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Figure 20.1 Genre of Music on ACIDplanet (as of 2007). 
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posted to the site, it is impossible for the webmaster to pólice music for pirated 
tunes all of the time. When citizens post songs to the website, they must agree that 
there has been no copyrighted music used in those songs without permission. If a 
Citizen does post a copyright-infringing song and the posting is discovered by the 
webmaster or reported by another user, then that song will be deleted and that person 
will receive a warning. If a user continúes to viólate the rules and posts copyright- 
infringing songs, that person can be banned from the website permanently — or at 
least until the user reregisters at the site using a different email address . 4 

Users also particípate heavily in the reviewing portion of the site. As soon as a 
user posts a song, other users can listen to it and leave their own comments for the 
original user, as well as a star rating from one to ten. The only formal parameters 
for reviewing are that the review cannot be malicious or obscene, and if a rating of 
fewer than five stars is assigned, then that review must be at least twenty-five words 
long. Reviewing plays a large role in the charts, as the ratings for each song deter- 
mine its chart position and, consequently, the number of people who are exposed 
to the song. Reviewing is an interesting feature in terms of sound studies because it 
forms a “translation node,” whereby the sonic dimensión — sound — is rendered 
into another médium (words) (more on this later). 

Another major feature of the website is the contests that it sponsors. Several of 
our interviewees report being first drawn to the site by discovering these contests. 
Remixing contests are the most common, but sometimes there are also mash-ups 
(where parts from sepárate songs are mixed together to make an entirely new musi- 
cal creation), and contests for original compositions. In remix contests, well-known 
artists dónate one of their songs to ACIDplanet to distribute in the form of sepa- 
rated tracks for ACIDplanet citizens to remix. Entrants can use as many of the 
tracks as they like and add their own interpretation of the song using their skills, 
instruments, and software. At any given time, ACIDplanet may host four or five 
contests, giving users many options regarding type of contest and style of song. 
Winners of these contests generally win hundreds of dollars worth of Sony prod- 
ucís, namely the most up-to-date music software and some loop libraries for future 
songs. Very occasionally major artists such as David Bowie or Madonna will offer 
tracks for remixing or mashing up. The Madonna remix contest held in 2002 (three 
Madonna songs were offered for remixing) produced the biggest spike in trafile 
the website has ever experienced . 5 The prizes to the winners of these exceptional 
contests are often much greater (an Audi car for the David Bowie remix). 

Citizens can customize their experiences on ACIDplanet in a number of other 
ways. Dave Hollinden, former webmaster of the site, explains: “ACIDplanet is 
somewhere . . . where you can just be an absolute hobbyist or be fairly accomplished 
or a power user or a real casual user and still . . . just come and upload, and, you 
might find a place for yourself in the community that way .” 6 Citizens can create 
their own list of favorite ACIDplanet artists and thereby provide a way to find their 
preferred music at the touch of a button. A user can pay to join the ProZone 
(a higher tier of Service), one of whose extra options is to change the color of the 
user’s profile. While this seems a superficial benefit, Citizen PhillyC says, “ [ I ] f you’re 
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flipping through all these pages that all look the same, and you come upon some- 
thing where the colors are different, that immediately catches your eye right there. 
And so you’re gonna pay more attention to that.” 7 

A big draw of ACIDplanet i s the charts. Every piece of music posted is given a 
chart position. The all-genre chart is the most widely acclaimed. Here the top ten 
songs at any time are displayed on the ACIDplanet homepage, but there are charts 
for every genre and subgenre of music. There is also an all-time greats chart. Based 
on how recently a Citizen posted a song and how many stars that song received from 
other members, that song will appear on all of the relevant charts. For example, an 
acoustic rock song will appear on the acoustic rock chart, the rock chart, and the 
all-genre chart — although it probably will not have the same ranking on all three 
charts. It may be number 2 on the acoustic rock chart, number 7 on the rock chart, 
and number 865 on the all-genre chart. The chart position (determined by a secret 
algorithm) depends on how many reviews a song can garner in a short period of 
time, as well as what the competition is like within a given genre. 

Different users place different emphasis on the charts. Some acknowledge that 
getting into the charts is the best way to get a song heard by the largest possible 
group of people. Others explain that the charts are not as important to them as just 
getting their song heard, and still others feel that regardless of how important they 
say the charts are for them, they still get a tremendous ego boost when one of their 
songs makes number one. Says PianoPlayer of the time when his song “Baby, Im 
over the Pain” went to number one for a week on the soft rock charts, “You know 
something? My . . . my head was like, out to here [demonstrates head being very 
large with arms].” 8 We have received numerous emails from users telling us when 
their music “charts” and giving us a link to the song. While the charts mean differ- 
ent things to different people, the fact remains that more people are exposed to the 
music at the top of the charts, so the best way for users to get their songs heard in 
general is to find a way to encourage more people to review them as soon as they are 
posted. As we will see later, this can lead to attempts to “game” the system to ensure 
greater chart success. 


Users 


Who joins ACIDplanet? Learning about ACIDplanet citizens is difficult because, 
although they may choose to disclose information about themselves in their pro- 
files, there is no way of easily verifying such information. Determining which pro- 
file belongs to a person of which gender, ethnicity, or country is not easy. Although 
the subset of 35 from the 300,000 registered users we have interviewed tend to have 
accurate profiles as far as we can ascertain, they may be an atypical sample because 
they agreed to be interviewed. Collecting accurate demographic information from 
the website is almost impossible because of the possibility of fake profiles and 
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because users may have more than one profile. 9 However, we were able to gain an 
understanding of users’ perceptions of the demographic makeup of the site. For 
example, everyone we spoke with believed that there were more men than women. 
The typical user, according to Citizen Johnnybreakbeat, is, C£ you know, [males from] 
late teens to, I’d say, probably early thirties, you know. Guys who sit in their rooms 
and make music. That’s generally what f m finding.” 10 Indeed, our own experiences 
confirm the huge number of male users. Despite explicit attempts to find and 
interview female users and offering an interview with a female researcher who had 
herself worked on gender and music (Athanasiades), we were able to interview only 
three female users for this study. 1 1 

The preponderance of male users of this technology is hardly surprising, given 
the gender inequities found in the world of computers, synthesizers, samplers, and 
the like and in the electronic dance music culture so popular at the site (Wacjman 
1991; Lie and Sorenson 1996; Faulkner 2000; Pinch and Troceo 2002; Rodgers 2010). 
The gendered aspect of ACIDplanet was apparent to the few female users we were 
able to interview. Women who disclosed their gender told us that they are treated 
differently from men by the men on the site. Citizen Hank Mohaski explains that 
there is not only blatantly different treatment of women but sometimes outright 
sexism as well: 12 “If she [a woman] puts a song up, and then you go back and read 
the reviews for that song, you’ll see the stuff percolating up, where it’s anything 
from like, ‘Are you hot?’ . . . ‘What are your measurements?’ All that kind of BS 
[bullshit] stuff that you find in a chat room.” 

Mohaski also noted that reviews for women artists would sometimes read like 
pick-up lines and that there was: 

this out-and-out sexism that, you know, “Oh, you’re a girl, and you suck and . . . 
you know, you’ll never make it,” and . . . then, if those females come into the 
forums, it’s kind of a lot of the same . . . I’ ve known a couple female artists that 
have left the site, specifically because of that. Because either they weren’t taken 
seriously as an artist or because they were basically being sexually harassed . . . 

I’ve heard that the [now defunct] chat rooms could get quite brutal at times 
when there was a female in there. 

Another more humorous example aróse when user Sonic Epiphany told of 
a time when he was flirting with an altérnate identity of one of his male friends 
without even knowing it: 13 

Sonic Epiphany: Oh, he, he creates pseudoidentities like [Spam Fíaters], like he 
has these identities like Jenny . . . McSpamHater. And then he does a song with . . . 
a girl singing, and her, and he just, “It’s Jenny and Trixie! We’re like, uh, you 
know, California fun and tricks.” 

Katherine [interviewer]: Oh ... I saw that [on the site]. 

Sonic Epiphany: That’s him! You thought it was a girl? I was flirting with him for 
a while, and I found out that Jenny McSpamHater was him! 

In this case, Sonic Epiphany (a male user) did not know anything about the 
person or people behind the profile “Spam Haters” except that they said that they 
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were girls. However, they turned out to be fictitious females created by a male friend 
of his. Later we return to the musical advan tages that these múltiple profiles some- 
times give users. The Internet is an interesting place in terms of gender perfor- 
mances; as well as constraining what is possible, it can enable new forms of 
gendering to emerge (Kendall 2002). 

Johnnybreakbeat’s observa tion that most users were under the age of thirty 
does not seem to be true for our sample. Indeed, we have continually been sur- 
prised by the numbers of people in their thirties and forties who have become users 
of ACIDplanet. The reason for this seems to be the convenience of the platform and 
the possibility of making music at home as opposed to the tiring traveling life of a 
working musician. Several of our respondents work from their own home studios 
(often with small children around, who are instructed not to mess with the equip- 
ment), and at the end of the day or night all they have to do to distribute their latest 
musical composition is click to upload to the site. For older people who have spent 
a lifetime in the music biz, working from a laptop at home is a luxury they never 
dreamed of when on the road gigging. 

Most users are based in the United States, where the website is located, although 
browsing through the site brings up users from all over the globe. Indeed, the pre- 
ponderance of U.S.-based users can actually serve to attract global users. For exam- 
ple, one Korean user we interviewed, Chester™, made heavy metal-style songs. He 
derived a big kick from getting his songs positively reviewed by U.S. users because 
the United States is perceived to be the home of heavy metal music . 14 

To give readers a better feel for who comes to ACIDplanet, we next describe 
two users, stephanie K and Jim Spitznagel, in more detail. 


Stefanie K 

Stefanie (figure 20.2) (who spells her profile ñame with a lowercase s) began playing 
the piano at the age of five or six under the direction of her father, who was a clas- 
sical piano teacher. Though he taught her classical music, she says that she “would 
change classical music into rock ‘n’ roll” whenever her father went upstairs. She 
stopped playing when she was fifteen, when her parents divorced and she no longer 
had a piano. She did not return to piano again until she was twenty-five. 



Figure 20.2 Part of stefanie K’s Profile on ACIDplanet (as of 2010). 
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Stefanie became a securities trader but gave up her career when she got married 
and had two children. In 2001 she was inspired to write a song about a friend who 
was killed in the World Trade Center attack. When she performed this song on the 
piano for some friends, they told her that she had some talent, and suddenly 
she found herself pouring out lyrics at all hours of the day. Of one song she said, 
a I wrote the lyrics while we were eating dinner at a restaurant — on a napkin. I write 
lyrics everywhere. I have pieces of paper in every bag I have with lyrics scribbled.” 

Stefanie began to meet other musicians Online through various forums and her 
own website. One of the people she met was Alex Stangl, who was at that time an 
active member of ACIDplanet. Stefanie had never heard of the site, but as soon as 
she began to participate, she was completely drawn in and started actively reviewing 
and collaborating with different members of the website. She was so involved, in 
fact, that she says, ct I was obsessed for a while. My husband got a little pissed off.” 
Stefanie began spending several hours a day on the website, listening to music, 
working on her own music, and reviewing other people’s music while her kids were 
at school and her husband at work. Stefanie was so caught up with the site that she 
had to limit herself. She says, “I used to be on the Computer all night, but I cut back. 
I stopped. . . . I’m pretty minimal now. I have my “Favorite” list, and, you know, 
people that I like, [if they] put up a new song, Til listen to it.” 

Most of the songs that Stefanie posts stem from collaborations. She says that 
many people contact her because they want her to sing on their tracks. Stefanie does 
not use ACID software, as she says that she simply does not have the time to learn 
it. However, she does have a small home studio that consists of a microphone, a 
keyboard, and a mixer. She usually goes into a professional studio to record her 
tracks because she says that her vocals come out better when they are professionally 
done. Generally she will bring a slew of her songs to the studio and have a producer 
decide which are the most promising and which need lyrical or musical work. After 
the session, she brings the songs back to her collaborators for reworking. Either 
Stefanie or one of her collaborators will post the song. 

Stefanie says that her kids — especially her daughter — love her music and that 
all of her kids’ friends listen to it (her daughter took the photograph of Stefanie for 
her profile, which another ACIDplanet Citizen further edited). While her husband 
is not particularly supportive of her hobby (Stefanie subsequently divorced after 
our study ended), Stefanie says that he “has started to accept it more because I put 
my foot down. I’m like . . . this is me! It’s my passion. I found it, and . . . you know 
this is what I wanna do.”) An added bonus is that Stefanie can follow her passion 
from home, which means that she can still be attentive to her kids and the chores. 
Stefanie would love for her songs to get radio play or to be recognized by producers 
and be adopted by mainstream artists. She does not foresee herself actually per- 
forming the songs because she says she does not see herself becoming a woman who 
hangs out and performs in bars. 

Stefanie has been compared to Tori Amos, Chrissie Hynde of the Pretenders, 
Kate Bush, and even Jewel though she disagrees with the last comparison the most. 
The songs she has posted on ACIDplanet that have attracted the most attention are 
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mainly rock and pop, but she has one song she describes as pan-Asian, which she 
sings in Japanese. As of 2007 Stefanie had two song s in the top fifty all-time top 
song s chart on ACIDplanet: “Regrets and Revelations” and “Rock Me.” 15 


Jim Spitznagel 

Jim Spitznagel developed an interested in music when he was a teenager. When he 
was fourteen, his parents got him a guitar for Christmas, and since then he has been 
playing both as a hobby and professionally. He was in a number of bands while 
growing up and worked odd jobs here and there in the meantime. When he was in 
his midtwenties, Jim finally got settled in a cover band, Eddie and the Otters, and 
also owned a well-known record store in Pittsburgh called Jim’s Records. Jim 
became a promoter and ended up working with many big-name bands before they 
broke out5, such as the Pólice, the Pretenders, the Ramones, and U2, to ñame a few. 
Jim’s home studio has many photographs of him posing with famous New Wave 
artists. Eventually Jim retired, sold the record shop, moved to Ithaca, and started 
his own online music store called JIMS, or Jim’s Ithaca Music Shop. He also started 
his own record label, Level Green Recording Company. Jim bought and became 
acquainted with music studio software such as ACID. Though he recorded other 
musicians for a while, he eventually decided to record and produce his own music 
exclusively. Jim has produced several of his own CDs to date and promotes them on 
ACIDplanet. He is a ProZone member, a perk he acquired by virtue of being a beta 
tester for an updated versión of the site. 

Jim has two profiles at ACIDplanet. His main profile, “Spitznagel” (figure 20.3), 
is where he posts his acid-jazz compositions. In 2005 he began posting music he had 
created with his recently purchased Moog Voyager synthesizer under a new ñame, 
“SMX” (which stands for Spitznagel’s Moog Experiments). He created the new pro- 
file so that the reputation of his Spitznagel profile would be unaffected by how well 



Figure 20.3 Part of Jim SpitznagePs Profile on ACIDplanet (as of 2010). 
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or how poorly his new, more experimental compositions were received in the 
ACIDplanet community. Jim, unlike Stefanie, does not particípate in collabora- 
tions simply because he does not have the time. He writes, records, and masters all 
of his own music. He has a very high standard for himself and an excellent working 
knowledge of the software. 

Jim is able to produce high-quality tracks time after time. He sees ACIDplanet, 
first and foremost, as a way of getting more exposure to his music — as he says, “I 
look at it as a marketing tool. And that’s what I always saw it as.” Like Stefanie, Jim’s 
music is good enough that he regularly climbs the charts whenever he posts a new 
song. His goal is “getting people to hear [the song]; hopefully some of those people 
who hear it will wanna buy my CDs.” As well as making music CDs and tracks for 
ACIDplanet, Jim also plays the occasional live gig using his Theremin, a recently 
acquired Tenori-on, and a stack of software synthesizers. He sometimes plays with 
another electronic artist as part of the band, the Beamis Point. In a surprising twist 
for both parties, Jim has recently started a musical collaboration with one of the 
authors of this chapter (Pinch), and they perform as the Electric Golem. 

Contrasting these two citizens illustrates the different ways in which users have 
appropriated the website and its options. Stefanie uses ACIDplanet to test new 
songs, but she also sees it as a final destination for many of her songs. Jim uses 
ACIDplanet as a marketing tool for his CDs because it gives his music increased 
exposure. Stefanie mainly collaborates with other users, while Jim works mainly as 
a solo artist. Stefanie does not use any sort of music-creation software and has pro- 
ducers fine-tune her music; Jim uses music-creation software and masters all of 
his own work. Stefanie is a pop-rock artist, and Jim is a jazz and experimental 
musician. 


New Practices and Terms 


The existence of ACIDplanet as an online place where people can congrégate, create 
their own musical identities, and share their music has led to several interesting 
developments. Citizens have created new terms to describe particular actions or 
events. For example, to be a “chart whore” or to partake in “chart whoring” means 
that a person is trying to climb the charts by using methods that are unacceptable 
to the ACIDplanet community . 16 One way to “chart whore” is for the original post- 
ers to create many fake profiles to review their own song in order to get more stars 
and to climb the charts faster. Sending mass email solicitation for reviews of a par- 
ticular song is another way to chart whore. A more labor-intense way is to write 
meaningless reviews of other users’ songs. Short positive reviews are offered along 
with a ten-star rating. This method (which is remarkably different from the peer 
reviewing practices found in the Sciences) works by the implied pressure on the 
user to return the favor (anonymous reviewing is not allowed) and to review the 
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reviewer’s own latest song. 17 This practice is so widespread at the site that it has its 
own special term, R = R, or Review = Review. 18 Citizen mtkzoa explained how this 
worked to boost one of his songs to an even higher chart position: 19 

Basically, this “Be Revived!” [title of song] thing is the first time — I posted it, and 
I got a couple of hits on it, and . . . by accident I found out I was like in third 
place. So I said, “Well, Im gonna do some reviews and see if I can push it up,” 
and I reviewed a few people, and then word gets around to other people, and they 
start reviewing your stuff, and the next thing you know, I was number one. It’s 
been that way for like five days now. 


Reviewing 


Reviewing is one of the main activities at the site. In order to understand how it 
works we look in more detail at some of the first few reviews received by one of 
stefanie K’s most popular songs, “Rock Me” (181 reviews as of February 2010), and 
Jim SpitznageFs most popular song, “Pharoah’s Corvette” [ sic ] (90 reviews as of 
February 2010). The genres of both songs are very different. “Rock Me” is posted 
under the genre of “Rock,” and “Pharoah’s Corvette” under the genre of “Jazz” and 
the subgenre of “Acid Jazz.” “Rock Me” is a guitar, drum, vocal-based song, which 
allows Stefanie’s voice to shine. “Pharoah’s Corvette” has a much more ambient 
sound and features what sounds like brass instruments, as well as washes of 
electronic sound. The reviews for these songs are typical of the site as a whole in 
that the authors seem to have penned them hastily — almost as part of a stream 
of consciousness. The reviews are frequently ungrammatical and have many spell- 
ing mistakes. They appear to have been written casually as the first reaction on lis- 
tening to the song (or even while it is playing) and are rarely carefully crafted as in 
more formal record reviews. Nevertheless, much detailed feedback can be gleamed 
from them. 

The very first review of “Rock Me,” posted under the heading “Radio Ready,” 
which signáis the reviewers’ opinión of the song’s commercial potential, is as 
follows: 

Very Nice job by all . . . love the blues leads on the right . . . Love the placement of 
the backing vocals and the bull horn FX on the vocals was cool too . . . great lyrics 
too! kudos peace. Skip 

One of the first reviews of “Pharoah’s Corvette,” under the heading “This is 
great,” was actually written by stefanie K (many of the leading artists on ACIDplanet 
review each other’s work): 

Quite intresting [sic] . . . you have certainly put alot of work into this mix ... its 
very busy in certain places ... I love the horn but I think you could have maybe 
cut it back a bit, and than [sic] it would be more dramatic when it appears . . . still 
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trying to figure out why you cali it Pharoahs corvette. cool title, does it refer to 

King Pharoah . . . nice job overall! stefanie K 

Both reviewers note the features of the tracks they “love,” such as the “blues 
lead,” “backing vocals,” “great lyrics,” and “horn.” In the case of “Rock Me” the 
reviewer notes the placement of the “blues leads on the right” and also the 
treatment added to the vocals. 

Other reviews note exact timings in the track. For example, in the case of “Rock 
Me,” one reviewer writes, “dig it at 2:08, very cool . . . this is put together real nice . 

. . gutars [sic] sound great . . . 3:12 is cool too . . . voice sounds great . . . hot track for 
real . . . respect.” Another reviewer is more specific: “Great guitar “solo” around 
1:19. The mini-breakdown at 3:11 is very smooth — great feel.” Reviewers also note in 
a friendly way what they think could be impro ved, such as in Stefanie’s recommen- 
dation to Jim on his use of the horn: “Cut it back a bit” so it “would be more 
dramatic when it appears.” Often ñames of well-known artists, whom the track 
reminds the reviewer of, are mentioned. For instance, JinTs track is described by 
one reviewer as being like the synthesizer-based jazz artist Adham Shaikh. Most 
reviews are very short, and a few are little more than positive endorsements of 
the artist and are explicitly of the R = R type in inviting the artist to check out the 
reviewer’s own music. However, even those reviews that seem to imply, demand, 
or return reciprocation can be full of information. For instance, one of the first 
reviews of Pharoah’s Corvette acknowledges an earlier review Jim wrote of the 
reviewer’s own music but still offers its own detailed comments on what is good 
about JinTs track: “[I] like the robot voices and the cool horn playing excellent 
combinations and great production thnx for the review of £ Beyond the Shores of 
Time 5 ed (AE).” 

Even a short tongue-in-cheek review that might verge on sexism can reveal the 
appeal of the music, as in this early review of Stefanie’s track: “Really starting to 
put a rump in your hump lately (lol) Edgier and heavier . . . and we really like the 
direction your [sic] taking . . . Great work!” 

These reviews are multilayered in the way they work. At the most fundamental 
level they allow the listener to render the sounds into the médium of words. The 
effect of a song on a listener is often expressed in metaphors such as “hump in your 
rump.” However, what is striking about these reviews is how they often mimic the 
sort of language found in recording studios, where the studio engineers, record 
producers, and artists have to discuss what sounds good or bad about the unfolding 
recording. Many of the comments combine the technical terms that recording engi- 
neers use (e.g., FX), technical features of the music (e.g., arpeggio), references to the 
particular sound of well-known artists, and descriptions of the sounds themselves, 
such as being “heavier” or “edgier.” 20 By turning sound into words the reviews 
make it possible for the users to communicate about sound within this community 
of technical practice. They enable the users to know and understand their listeners 
and offer guidance to all as to how to improve their music-making abilities and 
production skills. 
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Reviews are, however, double edged. Not only are they a means of translating 
sounds into words, thereby providing invaluable feedback and the chance to 
improve production techniques and so on, but they also form the basis of a users 
reputation at the website. The number of stars the review receives and subsequent 
chart placement ultimately determine the artist’s success at this website. As well as 
the reviews’ shaping what we might cali the “reputation economy” in this commu- 
nity, the reputation economy also shapes the reviews. For instance, because of the 
norm of reciprocation (R = R), those artists who are successful on ACIDplanet end 
up writing an enormous number of reviews. At the time of writing (February 2010), 
Stefanie had posted 2,566 reviews and Jim 3,541. In principie, reviews can be as long 
as the authors want them to be; the short “stream of consciousness” form most of 
them take follows from the need to write so many. Writing numerous reviews with- 
out repetition can itself be a problem as Jim notes: “If I feel that I’ve written a lot, 
the same thing, F 11 get the dictionary out and change a few words. And I just, say, 
Fll change ‘creative’ to ‘inventive.’ Oh, that’s good! Fll change, you know, c cooF to 
‘groovy.’ Oh, that’s good, you know!” 21 

There is, of course, no common rubric, so to speak, for writing reviews. 
Different reviewers will look for different things and evalúate compositions 
in a variety of ways. Often, when these reviewing rubrics conflict, feelings can be 
hurt and users can get upset. Jim, for example, gives only positive reviews. He 
sees reviewing as encouragement for the reviewed to continué making music and 
does not feel that he is in a position to critique them and their styles. Some people, 
however, feel that a central element of honest reviewing is critiquing other users’ 
music and thus rate and review people’s music accordingly. Nevertheless, the 
reviews at the site are overwhelmingly positive. This again follows from the need 
for reciprocation. As in most reviewing economies, reviewers self-select and 
simply do not offer a review at all if they do not like the music. Even a nine-star 
review as opposed to a ten-star review can be seen as a negative appraisal. 
Very negative reviews, perhaps posted from bogus profiles, are discounted as 
being “drive-by shootings,” written by reviewers who have ulterior motives 
(known as “haters”), such as to damage the reputation and chart placement of a 
rival artist. So again the reputation economy shapes in subtle ways the form the 
reviews take. 

Users often get disillusioned by the reviewing process — there are regularly dis- 
cussions in the forums about the lack of a standard for reviewing (encouragement 
or critique), the high incidence of nines and tens and the negative response when 
negative reviews are given. Hank Mohaski, for example, stopped reviewing a few 
years ago because he was so disheartened by the reviewing process: 22 

I thought the amount of time and effort I was putting into [reviewing], 
uh, I wasn’t getting any return on that investment, so to speak. ... So I’ ve 
become completely disillusioned with that aspect of it. Having said that, I still 
think it’s an important and valuable mechanism to the site. You know? It . . . 
serves its purpose very well for the people who care about it. 
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For most users. reviewing allows them to gain recognition (and reputation) 
incrementally, both in the act of reviewing and in the act of being reviewed. With 
reviewing, reviewers can build a supportive, encouraging, and perhaps technically 
sawy persona and become more likely to have their music exposed when the R = R 
expectation is followed. When a user is reviewed, there is not only the information 
to be gleaned from a detailed review but also the boost to chart placement that a 
good review can give. In addition, recognition is gained since many other users can 
read the review. Sometimes a particular reviewer will gain a following, and this can 
lead to even more exposure for a song. For example, the user :SEG: says that he 
gained popularity among the fans of popular artist MST when MST reviewed :SEG:. 
This is because MST is a widely respected ACIDplanet artist; thus, people respect 
his tastes in other users’ music. 


COLLABORATION 


As we have already noted with Stefanie, ACIDplanet offers new ways for its users to 
collaborate. For example, it has inspired the creation of several new online forums 
that exist solely for members of ACIDplanet to continué their dialogues and music 
creation in a smaller, more prívate setting. One of these forums, the Acid Exchange 
(known as AE), has evolved, according to our respondents, into a place where 
online friends hang out, chat, and create and share music. The forum is a closed 
site: In order to become a part of AE, users must be very active on ACIDplanet, 
display a high degree of technical competence in all aspects of their music, and be 
nominated by one or more people who are already in the forum. These forums are 
not part of the ACIDplanet site itself but provide a space for community building 
among particular groups of users. 

Musical collaboration of a more straightforward nature is also catalyzed by 
ACIDplanet. Users State on their profiles whether they are open to collaborations. 
These collaborations are usually accomplished virtually, as was the case with 
Stefanie’s collaborations. The “virtual band” aspect of ACIDplanet is enticing to 
some users who may recruit or advertise for particular sorts of musicians they 
would like to collaborate with. Sometimes virtual collaborations will turn into 
physically copresent collaborations in which musicians enter each other’s studios 
and make music together or even gig together. Sometimes ACIDplanet users get 
together to attend gigs or trade shows. One user who lived in London told us he 
regularly hosted overseas users in his London apartment. ACIDplanet therefore 
serves as both a destination for users’ music and a site where people are able to 
make connections and expand their music and musical identities. 

The new forms of collaborations that ACIDplanet offers can sometimes lead to 
tensions. Stefanie described one such incident that developed in her collaboration 
on the song “Simple Folks.” Citizen JC Kercheval wrote the instrumental for the 
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song and then gave the song to ACIDplanet artists Mark Moffre and Richard 
Montefiore to develop. He stated that whoever worked the song first could have it. 
Mark asked Stefanie whether she would collaborate with him on it, and they pro- 
duced a new song using JC’s original tracks. However, as soon as they finished it, 
they discovered that Richard Montefiore had already recorded his versión and 
posted it on ACIDplanet. Usually, unless the song is for a remix contest, it is against 
unwritten policy to post a song that someone else has already performed. However, 
because JC had given his tracks to both Richard and Mark and because the two 
songs were somewhat different, Mark and Stefanie decided to post their versión — 
with an explanation of what had happened and encouraging people to listen to 
Richard’s versión as well. In this spontaneous competition, both songs climbed the 
charts — and although Richard told people that it was a friendly competition, 
Stefanie says that he was really angry about the situation. 


Musical Identity 


Online identities and their flexibility have long been seen as key components of the 
Internet (e.g., Turkle 1997). One of the fascinating things about making online 
music is that it allows the musician’s identity to be detached from the physical ere- 
ation of the music in a new way. It also enables new forms of múltiple musical 
identities not easily realizable in any other way. The separation of the original work 
of art from its reproduction (Benjamin 1968) is of course a long-standing feature of 
recorded music (Mowitt 1987). Advances in recording techniques and technologies 
of synthesizers and samplers, including software synthesizers, have further dis- 
tanced the musician from the work of art, but, clearly, there are still musicians who 
are identified as somehow having enacted or realized works of art (Pinch and 
Bijsterveld 2003; Katz 2004). For instance, although not realizable in live perfor- 
mance because of its complex studio techniques, the music of Sergeant Pepper is still 
identified with the ageney of the Beatles (although people might refer to the “fifth 
Beatle,” Sir George Martin, in recognition of his production skills). With online 
music, such as that posted at ACIDplanet, musical identity and ageney are much 
more complicated things: Often all there is is a profile attached to the music. The 
identity of the musicians (if any) and their ageney in creating the music is now often 
solely revealed by the profile, and because musicians on ACIDplanet can have more 
than one profile they can, as we saw with the case of Jim and his experimental Moog 
Voyager music, maintain more than one musical identity. Furthermore, because a 
profile can be easily abandoned and a new one created, this form of online music 
making gives musicians an opportunity endlessly to re-create themselves. 23 

Of course, in that online identities have real-world counterparts, the affordance 
ACIDplanet offers might seem of limited valué. However, what must be borne in 
mind is that many musicians on ACIDplanet release their music only on the 
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website and never perform or meet up, as it were, “live.” The links between profiles 
(if they exist) may of course indícate that it is one and the same person behind the 
profile, but the formal separation of the reviews and the reputations they garner 
means that a sepárate online music identity can be viably pursued. 

One of our respondents, Hank Mohaski, claims to have many different online 
identities. Although he generally operates under what he deems his a main identity” 
of Hank Mohaski, he has at least twenty other profiles. Each of these was created to 
represent a different type of music. For example, his profile “Tape Jazz Massacre” is 
devoted entirely to music that comprises samples of songs and sound bytes he has 
collected over a period of years. Another profile is for the “Zanzíbar Cheap Band,” 
which plays music for elevators and supermarkets. Another example of a user with 
múltiple profiles is SkySwim. He was the creator of “Spam Haters,” the female 
hard-rock group who gained the attention and love of hundreds of users on 
ACIDplanet. 


ACIDplanet as a Community 


The “community” aspect of ACIDplanet, which makes it unique and differentiates it 
from other file-sharing and music-downloading sites, is still evolving. Furthermore, 
the crucial role the users play was realized very early on by the site’s developers. The 
original webmaster told us the following: 

f I] n the end, what we create is a shell that’s filled by the users, you know. . . . 

[Y]ou could launch this same site today empty, and people would come and look 
around like [it was] a big empty house and leave. And the users feel a lot of 
ownership in the site, and sometimes when we talk about bringing in a policy 
they don’t like, they’ll be very quick to say, “You know, we provide all the content 
for this site, you know. Don’t forget that.” 

The webmasters, who are also musicians and users of the site, are very aware of the 
role that users play in shaping the website. 24 They pride themselves on listening to 
users and providing features that allow them to offer feedback and suggestions for 
what they like or do not like and what they would like to see developed in the 
future. Julián Bain, the webmaster (as of 2007), explains how this interaction with 
users has been successful in the past and compares ACIDplanet to a different, failed 
Sony site: 25 

There was another site that Sony had, where they wanted a place for consumers to 
put what they made with Sony producís. And that was Screenblast. But they went 
at it from the point of view of what they, Sony, thought [it] should be. And they 
had a huge staff, and they put together a very nice-looking site. But it didn’t work 
out. And I think a lot of the reason for that, compared to us, who [are] just two 
people basically working on a daily basis, is that it’s very user-driven, for our site, 
and Screenblast didn’t turn out to be as [much of a] user-feedback-oriented site. 
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The users at ACIDplanet not only provide most of the content for the site but 
also form a devoted community who regularly return to the site to post songs, enter 
competitions, post reviews, and in many cases actually care about what is happen- 
ing at the site. Users who quit the site are sometimes bombarded by emails from 
other users to return (and often do return). This community appears to be more 
organic than regular file-sharing systems. 

Part of this process of community formation lies in the special form of medi- 
ated interaction that the website offers. Most social networking websites offer users 
a chance to interact by posting content, having friends, posting links, making lists 
of favorites, and engaging in certain forms of mediated interaction (Boyd 2006). 
The problem all such websites face is that every interaction with the website is in 
actuality discrete. The community can only form around the discreet users interact- 
ing with the traces (in the case of ACIDplanet, the songs and reviews) left at the 
website by other discrete users (Welser 2008). The more users that are involved, 
the greater the possibility that others can monitor these traces and the interactions 
users have with them (in turn producing new traces) — forming in a way an 
“audience” for monitoring the interactions (e.g., a review posted to a piece of 
music). All of this “sociality” takes place even though in principie there may be only 
one user at the website at any one time. The sociality of such a community comes 
from the norms and obligations built up incrementally among discrete users over 
the course of many such interactions. The more such interactions take place and are 
witnessable by many users, the more likely it is that any emerging community 
norms will be reinforced. In short, almost every interaction at the site is a public 
interaction visible (and often audible) to all users. This is unlike most social net- 
working sites, where users can, via privacy settings, restrict content to a selected 
group of users. 

ACIDplanet seems to go beyond most purely social networking sites not only 
by encouraging free access to most parts of the site but also by offering mechanisms 
whereby potentially large numbers of users are directed to particular parts of the 
content that users in turn produce. Special sociotechnical instruments, scopic 
focusing technologies (Knorr Cetina and Grimpe 2008) 26 — such as the charts and 
contests, which generate hierarchically ordered outcomes — allow users to focus 
on certain selected pieces of content (and thereby boost the reputation of the 
pro file associated with that content). Part of this process, as we have seen, involves 
“translation nodes” or “transductions,” whereby the sonic dimensions of the site 
are rendered into other symbolic forms (mainly words and numbers). 

In the wider field of popular music, hit charts (not to forget also the gaming of 
such charts with the payóla scandal, in which radio DJs were paid by record com- 
panies to promote certain records) and contests are very oíd ways of building repu- 
tations and directing consumers of music to particular songs. What the website 
does is to take these conventional resources and rework them in a new way. The key 
thing is that the working of these mechanisms in the online world becomes heavily 
dependent on the activities of users. Activities such as reviewing (which determines 
chart positions) and the judging of song contests by users (some contests award 
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prizes for successful judges as well) 27 do not make these mechanisms any more 
transparent (the exact formula for chart placement, for instance, is a well-guarded 
secret) but do give users involvement and empowerment. This means that reputa- 
tion is social not only in the sense that the outcomes of charts, contests, and the like 
are socially available to all users but also in the way the outcome is reached, which 
is itself dependent upon the users 5 social activities. This “double sociality 55 helps 
turn discrete users into an active and organic community. Because everyone is 
potentially both a producer and a consumer, everyone shares the norms and obliga- 
tions that develop, and there are fewer divisions between Cí fans 55 and “artists” than 
in other musical communities. 


Conclusión: A New Sonic 

SOCIOTECHNICAL ECONOMY OF REPUTATION 


Underpinning key features of the activities at the ACIDplanet website, such as 
reviewing and building musical identities, are the new economies of reputation that 
online technology enables (Masum and Zhang 2004; Resnick et al. 2006; David and 
Pinch 2006). The musical profiles and identities that musicians establish on 
ACIDplanet are subject to a complicated set of sociotechnical processes whereby 
some identities succeed more than others. Some of these features are obvious: 
Profiles that chart a lot are more successful than those that do not; competition 
winners gain recognition, and those profiles that never attract anything other than 
negative reviews for their songs or, perhaps worse still, no reviews at all are unlikely 
to gain recognition in the community. As with any community, however, the mores 
and norms are nuanced and, as in the case for the fast-changing technology of the 
Internet, somewhat in flux. 

Ultimately the reputation of users at ACIDplanet depends upon the sounds 
they produce and how others judge these sounds. Interestingly, although sound is 
at the core of the site — after all, it is a music site — most of the interaction users 
experience is via written and symbolic traces such as profiles, reviews, charts, 
and the framing of contests and their outcomes, not to mention the numerous 
forums. The translation, or what Stefan Helmreich and others in this volume cali 
the “transduction” from sound to another médium, is at the core of the reputation 
system. In short, the importance of the new technology is that it allows music and 
sound to be not only more easily produced but also turned into a symbolic form 
that a community of users can appraise and rank. The point we wish to stress is 
that the website, ancillary technology, and users together form what we cali a 
sonic sociotechnical community. 

Sound, as always, thus gets mediated by technology and community. What is 
ultimately unique about this community is that producers are consumers and vice 
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versa, and, as we have seen, this directly affects the norms, identities, and reputa- 
tions by means of which the community operates. Online music of this sort is 
capable of not only radically reconfiguring the ways musicians can form identities 
and reputations but also building upon and reinforcing rather conventional forms of 
musical identity and ways of allocating status (such as charts and contests). 

In this chapter we have shown not only the new possibilities that this new form 
of digital music making offers but also some of the problems and contradictions 
that arise. Although new musical identities are possible, oíd gendered identities are 
still very much manifest. Although new systems of building reputation emerge, they 
are based on the “charts” and “star” potential of older systems. The worlds of online 
and digital music are still in flux as the recording industry responds to a Computer 
company, Apple, which is becoming the major distributor of music via its online 
store, iTunes. 

Contradictions abound as to the future of online music production and con- 
sumption in general. Some see it as the way of the future, and others as a blind alley 
that will lead to little change and domination by a few major labels allied to the new 
distributors of online music such as iTunes. When diffracted through our users’ 
stories and their clear attachment to ACIDplanet and the user-generated commu- 
nity around it, the brave new world of online music appears perhaps a little less 
brave. 


NOTES 


1 For accounts of online music in general see Beer (2005) and Baym (2007). 

2 Interview with Dave Hollinden, Aug. 15, 2005. 

3 It is possible for a group of musicians to post their music from one profile, but there 
must be one registered user who “owns” the profile. 

4 For discussion of the complicated issues surrounding online musical copyright see 
Howard-Spink (2005), McCleod (2005), and Gillespie (2007). 

5 As determined by Daily Reach; www.alexa.com (accessed Oct. 19, 2006). 

6 Interview with Dave Hollinden, Aug. 15, 2005. 

7 Interview with PhillyC, July 7, 2005. 

8 Interview with PianoPlayer, July 22, 2003. 

9 Of course, it is likely that Sony has compiled more accurate information as the 
company has access to web logs and individual IP addresses. 

10 Interview with Johnnybreakbeat, July 16, 2005. 

11 Katherine Athanasiades wrote her undergraduate thesis at Swarthmore College on 
women Czech rock musicians ( Rockerka and Gender: Exploring Women in Rock in the 
Czech Republic, May 2005). Of the three women users we interviewed, one said she agreed 
to be interviewed only because the interview was to be conducted by a woman researcher. 

12 Interview with Hank Mohaski, Aug. 19, 2005. 

13 Interview with Sonic Epiphany, July 14, 2005. 

14 Interview with Chester™, Aug. 8, 2005. 

15 As of 2010 these songs no longer appeared in the all-time top fiíty. 

16 The use of the term whore further reflects the gendering of the site. 
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17 As of 2009 reviews can be posted anonymously. This change has been made almost 
certainly to try to prevent R = R from occurring. More recently, star ratings are no longer 
visible in each review, though what can be seen is an “artist score” and a “song score” 

on a scale of one to one hundred. Additionally, users can post a simple or an advanced 
review (the advanced review has sepárate components to comment on and rate, such as 
“production valúes”) in order to try to fulfill the requests of many users to have a more 
meaningful reviewing system. 

18 There was even a song posted at the ACIDplanet website titled “R = R.” 

19 Interview with mtkzoa, June 27, 2005. 

20 For a detailed analysis of this sort of talk see Tom Porcello, “Speaking of Sound: 
Language and the Professionalization of Recording Engineers,” Social Studies of Science 
34 (2004): 733 - 58 . 

21 One suspects many reviews on ACIDplanet are copied. In our study of reviews at 
Amazon.com we found about 1 percent of copied material (David and Pinch 2006). 

22 Interview with Hank Mohaski, Aug. 19, 2005. 

23 This possibility of having more than one musical identity has been practiced before 
in the offline world, where, for instance, bands such as Devo and XTC recorded under 
different ñames. ACIDplanet, by making this option democratically available to all, shows 
again the tensions between change and continuity at the website. 

24 The users’ role in the development of technology has recently gained much 
attention (e.g., Oudshoorn and Pinch 2003). 

25 Interview with Julián Bain, Aug. 18, 2005. 

26 Karin Knorr Cetina (e.g., Knorr Cetina and Grimpe 2008), in her work on financial 
markets, suggests that Computer screens and other sociotechnical devices that bring the 
whole market to the trader can be thought of as “scopic” Systems. The sorts of devices we 
have in mind do the opposite because, rather than bringing the whole market (actually a 
representation of it) to a user, they bring a particular piece of content to a large number 
of users. Most voting technologies popular on the Internet for rank ordering items are of 
this type. 

27 Contests used to be judged by the artists themselves or “their people” at the record 
label. However, under pressure from users who wanted more say in determining the 
outcome of these competitions, in 2007 ACIDplanet introduced a new form of contest 
called “the mosh” (now renamed the “duel”), in which users can vote for the winning 
entry and in the process also be rewarded for their voting behavior (see Pinch and 
Athanasiades 2009). 
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MO VING 
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STORAGE 



CHAPTER 21 


ANALOG TURNS 
DIGITAL: HIP-HOP, 
TECHNOLOGY, AND 
THE MAINTENANCE 

OF RACIAL 
AUTHENTICITY 


RAYVON FOUCHÉ 


The history of the black diaspora i s replete with examples of the ways music has 
enabled various black cultural communities to cope with racial oppression, carve 
out livable physical and psychological niches within an adversarially racialized 
world, and create supportive environments for Creative expression (Floyd Jr. 1995). 
From the spirituals sung by slaves during work in the nineteenth century to the 
protest songs that motivated black freedom movements around the world in 
the mid-twentieth century, music has been a constitutive part of the historical 
fabric that defined black social, political, and cultural agency (Ward 1998). With the 
global unraveling of de jure segregation, the direct link between music and black 
empowerment has waned. By the early 1970S in the United States, black music was 
beginning to lose some of its visible and audible political urgency. Creative energies 
were redirected to foment a new movement supporting urban cultural expression. 
This movement would eventually be known as hip-hop (George 1998). Though 
hip-hop is not singularly black or only a genre of music, some of the most domi- 
nant representations of hip-hop center upon two sets of artifacts: turntables and 
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vinyl records. These objects have been positioned to represent the nostalgic pasts, 
the vibrant presents, and the emerging futures of hip-hop. 

For most black social, political, and cultural movements, words and iconic 
figures have been the dominant representations of black life (Green 2005). 
Empowering and polarizing figures like Frederick Douglass, Booker T. Washington, 
Frantz Fanón, Martin Luther King Jr., and most recently Barack Obama have orated 
black consciousness. For hip-hop, the lyrics, performances, and significant figures 
are critical to the historical construction of hip-hop. Flowever, turntables and 
vinyl records demand similar historical and cultural recognition that in certain 
instances trump the power of a human actor. A head-bobbing and headphone-clad 
DJ bent over a set of turntables while spinning, mixing, and scratching has become 
an iconic representation of hip-hop (Goldberg 2004). Yet, within the last decade 
a new type of DJ has challenged this image: the digital DJ. These DJs are still seen 
grooving with headphones perched on their head, but they are now hunched over a 
new set of musical tools that digitally process inputs, signáis, and sounds. The digi- 
tal revolution carne late to hip-hop turntables but had profound implications for 
the culture nonetheless. On the surface it would seem that this technological evolu- 
tion would be welcomed, but by some it was not. For those wedded to analog turn- 
tables, needles, and vinyl records, these new devices were undermining a valorized 
historical representation of hip-hop. When forward-thinking or opportunistic 
companies (depending on your subject position) designed new technologies to take 
advan tage of digital music, fissures within this community of hip-hop aficionados 
emerged. The crisis was not only about the technology,production, and manipula- 
tion of sound but also about one’s access to, membership in, and identification with 
hip-hop. For some of the most ardent hip-hop heads, these new technologies could 
change the community forever. Yet in our current technological age, such change 
should not be seen only as an either/or option. 

For hip-hop this tensión, which had social, cultural, acoustic, and racial 
underpinnings, was resolved through a new technology aptly named a “digital vinyl 
system,” which allowed participants to stay true to their versión of the roots of 
hip-hop while simultaneously embracing the sonic and technical opportunities 
available with digital instruments. This controversy over technological choice and 
use that defines a community was resolved by embracing both sides of the contested 
space through a new technology created by an interested third party. Thus, a new 
technology provided a solution to a technocultural controversy. One of the first 
globally available technologies to fill this space was a hardware/software digital vinyl 
system named FinalScratch, which enabled DJs to map digital media onto a 
specially encoded vinyl record using an analog turntable. Surprisingly, this device 
was not created by hip-hop fans or turntablists but by a small Dutch software 
company, N2IT. The development, reception, and integration of digital vinyl Sys- 
tems illustrates how technological tensión between conceptions of hip-hop culture, 
identity, and authenticity can be resolved by a mediating technology. 

Within the large body of scholarship on black cultural expression and commu- 
nity formation, technology and the ways that black people all over the world have 
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used, created, and redefined technology are somewhat overlooked. Arguably, the 
role of technology in these spaces is most often viewed as either an adversarial or a 
benign affordance that human actors overeóme or instrumentally use (Eglash et al. 
2004). This chapter explains how sound-producing technology, in the form of vinyl 
records and turntables, functions within communities that endow these devices 
with cultural valué. Hip-hop is used to center the discussion on the ways in which 
turntables and vinyl records are ascribed a racial authenticity not seen in other 
music communities where DJs exist. The racialized and nostalgic valué of these 
artifaets for those inside, outside, and on the fringes of hip-hop is linked to familiar 
narratives of the destruction of black cultures, histories, and pasts that can be 
symbolized by events like the transatlantic slave trade or the múltiple appropria- 
tions of black music and culture (Gilroy 1993). In this historical context, what 
marks hip-hop as unique is that it is a product of a cultural moment in which black 
people have had the power and authority to construct and defend hip-hop as a 
cultural community (Dyson 1993). 

I begin with the premise that the amorphous object of inquiry known as 
“hip-hop culture,” similar to other music cultures, is a deeply technological way of 
life (Braun 2002; Chanan 1995; Frederickson 1989; Katz 2004; Lysloff 2003; Manuel 
1993; Miller 2004; Morton 2000; Pinch and Troceo 2002; Taylor 2001; Théberge 
1997). As hip-hop has evolved from a style, form, and sound of music associated 
with a multiracial community of artists, musicians, and partygoers in the boroughs 
of New York City to a global movement that melds, bends, and intertwines dispa- 
rate aesthetic, cultural, social, and sonic knowledge and experiences, a few elements 
have remained relatively constant. Some of the most constant have been technology 
and sounds or, more specifically, the artifaets and the beats. What is interesting 
about this consistency is how little the technology changed over the first several 
decades of hip-hop. Turntables and vinyl records, for hip-hop’s short life, have 
been the dominant technological tools of cultural expression supporting musical 
creativity and exchange. Without the tools and the desires for cultural expression, 
hip-hop would not exist. Jeff Chang (2005), Murray Forman and Mark Anthony 
Neal (2004), Nelson George (1998), Patrick Neate (2004), and Tricia Rose (2008) 
are a few of the many who have written eloquently about the local and global circu- 
lations of hip-hop. However, the literature on technology and hip-hop has not been 
as broad. In particular, the works of Joe Schloss (2004), Mark Katz (2004), Alexander 
Weheliye (2005), and Kodwo Eshun (1999), as well as facets of the Afrofuturist 
movement, have been instrumental in building these links. This work is excellent, 
but it has not fully explored the ways in which turntables and vinyl records acquired 
the cultural valúes and social power to render them critical elements in the story 
of hip-hop. 

Undoubtedly hip-hop is larger than turntables and vinyl records, but I do not 
address its limits here. Many scholars, fans, and artists have done that already 
and will continué to fight this never-ending battle (Bennett 1999; Condry 2006; 
Forman 2002; George 2001; Haskins 2000; Keyes 2002; Kitwana 2003; Maxwell 
2003; Mitchell 2001; Potter 1995; Rose 2008). Historically, hip-hop has been 
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composed of four elements: DJing, MCing (rapping), b-boying (break dancing), 
and graffiti writing (Austin 2001; Forman and Neal 2004; George 1998; Rahn 2002). 
There are múltiple overlapping forms of DJing: club DJs (who spin records for 
a dancing/party audience), production DJs (who create beats for a variety of perfor- 
mances), and turntablists (for whom the turntable is a performance instrument) 
(Bennett 1999; Kitwana 2003; Brewster and Broughton 2000; Poshardt 2000; 
Reighley 2000). As hip-hop has expanded and elements of it have waned, the DJ has 
been viewed as the keeper of its history (Souvignier 2003; Broughton 2003). With 
the responsibility of maintaining hip-hop comes the added pressure of remaining 
true and, most important, authentic to its roots. Not surprisingly, the term 
authenticity has engendered conflict. As a cultural historian of hip-hop, an authen- 
tic DJ is expected to possess the highest level of turntablist skill, a massive collection 
of vinyl records, and a voluminous mental archive of beats, breaks, and sounds. 
As hip-hop has become more global, the demands of this essentialized form of 
authenticity become harder and harder to uphold. What is interesting about 
the authenticity required of hip-hop is that it is technological in nature (Maces 
2008). 

In Faking It: The Quest for Authenticity in Popular Music , Barker and Taylor 
(2007) contend that authenticity by performers and audiences of all musical genres 
has been a never-ending quest in every córner of the art form. The problem has 
been that authenticity means something a bit different to everyone. Some artists 
and audiences are more interested in a representational authenticity that is 
displayed by “really” playing or singing while performing. Others are moved by 
personal authenticity, in which artists express themselves on the basis of their 
experiences. Performing from a location of cultural authenticity speaks to another 
constituency. Baker and Taylor argüe that the need for authenticity has inspired 
groundbreaking music, as when Jimmie Rodgers pioneered country music, but it 
has also done damage: “White blues fans . . . defined the genre in the ñame of 
authenticity to exelude anything too jazzy or upbeat, thus enforcing a snobbish and 
racist exclusión of certain blues artists from the cannon because they were too 
sophisticated.” This fantasized form of racialized music authenticity “lauded the 
most primitive blues artists they could find, such as John Lee Hooker, from whom 
blacks turned away. In this way, the quest for authenticity did tremendous damage 
to the blues by codiíying certain traditions and limiting innovation” (Baker and 
Taylor 2007, xi). What is dissimilar about hip-hop is that it was born in a different 
racial historical moment. In this moment black artists, fans, and consumers led the 
processes by which authenticity was defined, and it is a definition wrapped in 
turntables and vinyl records (Harrison 2009). Musicians and scholars have 
attempted to manage the issue of how to intégrate new technology for centuries. 
Simón Frith’s (1986) work on popular music and technology has thoughtfully 
destabilized the assumption that technology undermines or falsifies the authentic 
nature of music. Paul Théberge (1997) has eloquently explored the ways in which 
music and electronic tools have been coproduced. Similarly, Bernstein and Rockwell 
(2008) trace the development of the community of electronic artists, musicians, 
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and performers that for a short period of time called the San Francisco Tape Music 
Center home. In order to learn how hip-hop aficionados responded to new 
technologies that challenged what many understood as a taproot of hip-hop, this 
chapter explore the ways in which turntables and vinyl records as conduits of a 
nostalgic and racialized technological authenticity required hip-hop to intégrate 
new devices that did not undermine the community. 


Hip-Hop and Race in the Digital Age 


This is a story about the complexities of musical and technological evolution in the 
digital age (Goodwin 1988; Greene and Procello 2004). Questions concerning 
change and transformation have been central to the critical study of technology. In 
the past century thinkers like Lewis Mumford (1934), Siegfried Giedion (1948), Leo 
Marx (1964), Thomas Kuhn (1962), Langdon Winner (1978), Bruno Latour (1988), 
Wiebe Bijker and Trevor Pinch (1984) and many others have theorized about 
how and why different groups of people — whether a nation-state or cultural 
community — migrate from one technological era, period, or domain to another. If 
this is a story about technological change, how does hip-hop fit into these historical 
narratives? Generally, it does not. Examining race is vital to understanding hip-hop 
as a technologically inflected movement. The sound of hip-hop is not just about the 
music but equally about the sonic disruptions, stylistic reconfigurations, and 
aesthetic beauty that hip-hop released from and into urban and eventually subur- 
ban landscapes. Music has always emanated from a variety of nooks and crannies 
within cities, and the machinery that inhabit these urban environments has 
profoundly shaped black music (Dinerstein 2003). In the United States, this changed 
in the mid-i970s. The industrial cacophony of machinery still bathed the city with 
familiar sounds, but new technologies of listening allowed for many more public 
performances of music. The compact nature of urban dwellings has always pushed 
activity to the Street. Black music consumers led the way not only by filling the 
streets with black music but also by creating a new aesthetics of sounds connected 
to a visual arts movement (Smethurst 2005). Hip-hop is about the music and the 
cultural expression of a sense, a feeling, or a style and about performing these 
sensibilities with technological artifacts. Historically familiar images of Kangol hat 
and shell-toed, Adidas-clad performers are balanced by massive-speakered sound 
Systems and shoulder-mounted portable boom boxes that dotted the urban Street 
with black-informed sound, music, and technological use. This confluence of 
sound, music, technology and blackness, precipitated by acts of black vernacular 
technological creativity, brought the world hip-hop (Fouché 2006). Hip-hop and 
turntables allow one to glimpse the ways contemporary technology is interwoven 
into evolving perceptions of culture, identity, and authenticity, specifically within 
the context of sound. They also allow one to consider how technological identity 
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and the need for a racialized perception of authenticity functioned within the hybrid 
culture of hip-hop. 

The participants in hip-hop have been amazingly technologically innovative, 
but how have race and racial identity contributed to and inspired this work? From 
transforming the turntable into a musical instrument and developing scratching, 
practitioners of hip-hop have embraced technologies as a means to reshape music. 
Though Tricia Rose’s (1994) seminal work, Black Noise , focused primarily on rap 
music, some of the most valuable insights were about race, music, and technology. 
In arguing for a fuller understanding of the relationships between black music 
culture and technology, she concluded: 

Rap technicians employ digital technology as instruments, revising black musical 
styles and priorities through the manipulation of technology. In this process of 
techno-black cultural syncretism, technological instruments and black cultural 
priorities are revised and expanded. In a simultaneous exchange rap music has 
made its mark on advanced technology and technology has profoundly changed 
the sound of black music. (Rose 1994, 96) 

If Rose were writing the same section today, she would undoubtedly extend this 
prescient statement beyond rap music to the broader category of hip-hop. 

Beyond the music practices, artists have collaborated with engineers and tech- 
nologists to rethink and redesign the tools of the trade. Artists like Grandmaster 
Flash and Pete Rock and companies like Rane, Vestax, and Akai have participated in 
and promoted the collaborations in equipment design. Specifically, DJ legend 
Grandmaster Flash worked with Rane to design the innovative Empath mixer to be 
true to his hip-hop sound sensibilities. Flash has spoken about the contentious 
design negotiations that were necessary in order to have a mixer that satisfied his 
technological and cultural needs and wants (Fouché 2006). The short-lived maga- 
zine Scratch dedicated its pages to the beats and the technology of making those beats 
and wonderfully chronicled this symbiotic relationship. Scratch regularly ended 
interviews with a “toolbox” byline indicating technologies of choice. In an interview, 
hip-hop producer and DJ Just Blaze had an overflowing toolbox that included Akai 
MPC4000, Vestax QFO, Roland V-Synth, Roland JV-2080, Roland JV-3080, Roland 
XV-5080, Korg Tritón Rack, Minimoog Voyager, and Alesis Andrómeda (Sharp 
2005, 74). Just Blaze gladly used any and all tools that were available. The back 
sections of the magazines were divided into different “lab” spaces for readers to 
enter and learn about new techniques, artifacts, and practices. By highlighting the 
technological craft and skill of hip-hop, Scratch emphatically exclaimed that hip-hop 
is as much about the sound and the music as it is about the technology and the 
artifacts. More important, it confirmed its deep technological underpinnings. 

In most cases hip-hop has embraced new and emerging technologies of sound 
recording and reproduction and has smoothly made the transition from its analog 
origin to its digital present. Yet, the turntable and vinyl have been somewhat of a 
sticking point. The art of DJing is an indispensible skill that provides entrée into the 
constantly moving inner sanctum of hip-hop and is also a way to display a level of 
credibility and commitment to the art form. Even for technophiles, turntables and 




ANALOG TURNS DIGITAL 


511 


vinyl records represent an indispensable hip-hop past, without which the present 
and future could not exist. Just Blaze, one of these avowed technophiles, is also a 
vinyl aficionado. He uses his voluminous vinyl collection for inspiration and as a 
material display of his musical knowledge, his commitment to hip-hop, and his 
reverence for the history of hip-hop. Just Blaze explains: 

I’m a collector, so a lot of times Til buy records that I won’t even make a beat 
off of, but it will be like a classic breakbeat where there have only been a few 
thousand pressed . . . To find a classic hip-hop record from over the years, to 
find a mint 12-inch is like having a part of history, so Til go pay $200 for that. 

Til never use it in a beat. Til never spin it when Fm DJing because I wouldn’t 
want to mess up the record. (Sharp 2005, 72) 

Just Blaze’s respect for the history of hip-hop is expressed in his affection for 
vinyl. His comments also touch upon the meaning of consuming and possessing 
artifacts of hip-hop. For him, as well as others, it is important to possess a part of 
hip-hop history because he has taken on part of the collective responsibility 
of archiving the music. Currently the desire to experience or possess the artifacts 
of hip-hop is reflected in strong markets for certain vinyl records, Technics 1200 
turntables, and classic hip-hop mixtapes. 

Hip-hop has been as much about the distribution of sound as it has about the 
creation and manipulation of sound. Throughout the history of hip-hop, artists 
have managed the reproduction and exchange of their music through the mixtape 
(Bell 2007). The term mixtape refers to any form of recorded music/sound on a 
distributable médium. The development and commercial release of the compact 
audiocassette by Philips Electronics in the early 1960S was a technological innova- 
tion that hip-hop eventually exploited. Though the médium has migrated from 
inscribed music/sound onto magnetic tape within a plástic enclosure to encoded 
data on compact dises (and more recently the is and os of Online digital files), the 
importance of the mixtape has not changed. Historically mixtapes allowed hip-hop 
artists to sell their spontaneous Creative work to interested fans and enabled fans to 
make the experience transportable and replayable. More recently, the mixtape has 
become a promotional tool of artists and the music industry; however, for hip-hop, 
mixtapes have always been an important way to extend the party by conveying the 
experience of a live performance, presenting new artistic creativity, and spreading 
the sound of hip-hop. Thus, mixtapes, by distributing hip-hop rather than creating 
and manipulating sound, did not challenge the roots of hip-hop. This distinction 
marks an important difference in the ways that hip-hop has embraced technology. 
Those technologies that were perceived to augment or extend hip-hop’s reach 
without undermining it were readily accepted. 

Most of the technological incursions that spread within hip-hop, as with 
mixtapes, were acceptable, but something happened when the technologies changed 
from analog to digital. The mixtape, for all intents and purposes, was perceived as 
an analog technology in that the music was inscribed and visually recognizable on 
magnetic tape. If one looks closely, one can discern the topographic contours of the 
music recorded on an analog technology like a cassette tape or a vinyl record. 
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However, once one enters the digital realms, the music/sound loses its visible and 
tactile qualities. The sonic information is now housed in a series of black boxes 
represented by CDs, computers drives, and lines of code that viscerally disconnect 
the creator and the user from a level of musical materiality. The transition from 
analog to digital reframed the understood materiality of sound, music, and hip-hop 
itself. The reframing challenged prior assumptions of what hip-hop was. In this 
instance, the change from analog to digital precipitated a small but meaningful rift 
within certain subcommunities of hip-hop. These fissures produced two loosely 
defined camps that fought over the cultural past, present, and future of hip-hop. 
The factions constructed boundaries of authenticity around the appropriate 
or inappropriate use of technological artifacts. Historical argumentation that 
contended that analog was true to the roots of hip-hop clashed with futurist visions 
of the next step in hip-hop. What it meant to be a member of the hip-hop commu- 
nity could hinge upon one’s position on analog and digital technologies in relation 
to the use of turntables and vinyl records. Of course, there was much slippage 
between these two communities. Many hip-hop artists use the latest and greatest 
production tools and equipment, but the place of the analog turntable and vinyl 
records is central to a favored, constructed history of hip-hop. 

A racial subtext also exists. Black history outside of the African continent is 
replete with examples of exploitation through the application of technology. From 
shackles and ships that extracted African slaves from their homeland to the 
processes by which their bodies were reduced to commodities, black people have 
had an adversarial relationship with technology (Walton 1999). Hip-hop has 
endowed a specific black community with the power to define and create an empow- 
ering past. Interwoven into this past are vinyl records and turntables. It is in this 
context that digital technologies, which have the ability to overshadow or even erase 
this past, have caused many to hesitate. The tensión between analog and digital, 
or oíd and new, has been a popular trope within twentieth-century studies of tech- 
nology (Edgerton 2010). In the case of turntables, one can see a spinning record, the 
Stylus tracking a groove, and the spatial orientation of how much music has been 
played. The turntable is seen and heard in a familiar machine-age way, whereas 
digital technology is not seen or heard in the same way. The sonic feedback from a 
digital turntable comes primarily from the quietly rotating motor, which is 
perceived as analog but in actuality is at the limits of analog. The hidden láser that 
extracts information from a CD makes very little appreciable noise. The ocular 
assault of a digital turntable is a powerful juxtaposition of the bland visuals of an 
analog turntable. The digital turntable is marked by its silence. One can conceive of 
the stream of is and os passing through the virtually silent circuits, but the process 
by which these bits and bytes are converted into music is a mystery to most. It is the 
lack of mechanical age feedback that separates analog from digital in the minds of 
many (Mindell 2002). 

The power of this dominant historical narrative, in concert with a racial 
subtext, required most hip-hop followers to take a position, tacitly or explicitly, 
on analog and digital. This tensión raises interesting questions about the recent 
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past and how transitions from analog to digital can disrupt the way in which a com- 
munity defines itself. In specific, it can be disruptive if this community defines itself 
through the use of certain tools. When these tools change, does the community 
change as well? This is a familiar question within Science and technology studies, 
but it raises questions about how to understand the history of hip-hop through the 
lens of technology. This technocultural rift reveáis the tensión within hip-hop 
between technology and sound. If hip-hop is about the technology, then it matters 
deeply which technologies are used to produce the music. If it is about the sound, 
then it does not matter which tools are deployed to create hip-hop. Thus, this 
technological evolution affects the intertwined perception of the impact of technol- 
ogy on the sound of hip-hop. Many artists, producers, and hip-hop fans did not 
follow or participate in these debates. If pushed to the extreme, they may have 
produced a paradigmatic shift or technological closure that would have left analog 
technology (and those who use it) in the “loser” categories of outmoded technolo- 
gies or discarded artifacts. However, together with music consumers, organizations 
and individuáis invested in the survival of hip-hop as an evolving culture of 
technology developed technologies that enabled a balance between oíd and new, 
past and future. 

The language of the digital revolution speaks to a disappearing analog past. 
Digital rhetoric is put into direct opposition to the analog. Supporters of a digital 
future contend that a reliance and connection to the analog past is an anchor that 
preven ts one from embracing the unbridled poten tial of the digital age. Some would 
even argüe that we have “crossed over,” as did a 2002 Wired Magazine infographic 
that illustrated that, from that year on, most consumers would purchase cameras, 
music, and televisions in a digital rather than an analog format (“The Great 
Crossover” 2002). The mounting pressure to cross over obscures the actual 
experience of existing on both sides of this illusory divide. If we think about this 
analog/digital divide as also a machine age/computer age, visible/veiled, sound/ 
silence divide, it becomes apparent that most technological objects we encounter 
incorpórate aspects of each polarity. However, the proliferation of the digital has 
overshadowed this liminal space between the two realms. 

Researchers like Henry Jenkins, who cali our current moment a “convergence 
culture,” have put forth some of the most popular language about the digital age. 
In his words convergence culture “represents a cultural shift as consumers are 
encouraged to seek out new information and make connections among dispersed 
media contení” (Jenkins 2006, 3). Jenkins and others are interested in the conver- 
gence of digital media and information, both now and in the future. As catchy as 
convergence is, it is important to take a step back and study the moments before we 
merge into digital convergence to understand how digital technologies are not just 
replacing analog technologies or converging but are also very accurately simulating 
analog devices. We have not reached a point of convergent homeostasis. What does 
it currently mean that, even though we have a plethora of digital technologies, older, 
nostalgia-laden analog technologies from handmade shoes to classic automobiles 
resonate deeply with many people around the world? The perception is that 
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they are more human or humane because they lack some robotlike efficiencies 
(Georges 2003). 

It is important to understand the extremes of analog and digital, as well as the 
synergistic space in between, if we are to grasp the contemporary relationships 
between technology and hip-hop. For hip-hop, technologies that exploit the 
synergies between analog and digital have the potentially preserved collective and 
individual identities while maintaining user tactile and mechanical control. One 
current example of this synergy can be seen in software music-production tools. 
Propellerhead’s Reason , a commercially available, software-based ensemble of 
music-production tools, has waded into this space with its graphical interface. When 
working with the software, the user first sees the front of a graphical interface, which 
resembles familiar production equipment with the requisite collection of knobs, 
buttons, and switches and the familiar software dropdown menus and folders. 
However, it is the “back” of the interface that is most interesting. The rack system 
rotates virtually to display the back of the system with interconnecting cables. A user 
can click and drag to manipúlate the cables to simúlate cable patching that would 
be necessary to use the physical studio hardware. Why would a company model the 
back end for a piece of software? There is no need to patch cables or even look at 
the rear of the device in a virtual space. Propellerhead’s promotional material 
provides a rationale for simulating the analog experience of moving cables: 

Since Reason is made to look and work just like hardware equipment, we thought 
that the best and easiest way to handle the audio routing and such would be to 
do it like you do it in the studio: Flip the rack around! By hitting the [tab] key, 
the back of the rack is displayed and here all the cables and connector jacks are 
visible. . . . This form of patching means a fantastic freedom for the producer 
since you can create the most crazy noises by combining machines in the rack. 1 

This synergy between the analog practice of patching cables and the digital activity 
of clicking nicely embraces both forms of actions. Arguably the success of Reason 
is directly connected to this synergy. The designer and programmers chose to 
consciously embrace the synergy of both the analog and the digital to produce a 
piece of software that did not aliénate the potential users by eliminating familiar 
actions within a virtual space. Yet the case of Reason is distinctly different from that 
of hip-hop. The racial history of the technologies of hip-hop and what they mean 
for an origins story of hip-hop pro vides an additional layer. 


HlP-HOP AND THE EMERGENCE OF DIGITAL 

Vinyl Systems 


Supported by an explosión in music and engineering, the 1960S saw a new moment 
in relationships between sound and technology. In the 1950S the transistor radio 
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made music portable. In the 1960S, the compact audiocassette enabled portability 
and the added benefit of recording. The audiocassette profoundly restructured the 
processes by which everyday people listened, captured, and preserved sound. 
Although the audiocassette was important to the history of hip-hop, the develop- 
ment of new turntables in the late 1960S and early 1970S was even more influential. 
In 1965 the Matsushita Corporation, under its Panasonic división, launched the 
Technics brand to compete in the unfolding high-end, consumer hi-fi audio market 
in Japan. Loudspeakers were the first product it produced, and in 1969 Technics 
introduced a turntable, the SP-10. This was one of the first consumer turntables to 
eliminate belts and employ a motor to directly drive the platter on which a vinyl 
record rests. Technics named the next iteration of this turntable the SL-1100. Early 
hip-hop artists found this turntable to their liking due to its strong motor, durabil- 
ity, and fidelity — two things Technics turntables have been known for since then. 
In 1972 Technics released the SL-1200, which over the next few decades became the 
turntable of choice for múltiple musical styles. It became the most popular visual 
representation of hip-hop and has now reached cult status. Other turntables are 
more powerful and more flexible, but the 1200S have maintained their tradition and 
authenticity. Part of this history is connected to the early adoption of Technics 
turntables by some of the first hip-hop artists, like DJ Kool Herc during hip-hop’s 
early days (Chang 2005). The turntable’s partner is the vinyl record, and the 1970S 
vinyl phonographic recordings proved their ability to reproduce sound with a 
decent level of accuracy. For hip-hop DJs, this combination was unbeatable. The 
power to manipúlate sounds, beats, and musical expressions through the use of two 
turntables and a mixer in between, with a seemingly endless array of vinyl records, 
solidified the connection of hip-hop to these artifacts. 

The symbiotic relationship between hip-hop’s cultural output and the techno- 
logical artifacts significantly dictated the public representation of hip-hop. Anyone 
could enjoy the culture of hip-hop, but to be a real hip-hop head one not only had 
to know beats, breaks, and rhymes but also use Technics 1200S and vinyl records 
(Neate 2003). Some did stray from this tacit directive, but most supported this 
visión. Similar to the way material culture scholar Ian Woodward (2009) writes 
about the way in which individuáis and families nárrate their lives through the 
objects in their homes, the history of hip-hop is partially narrated through analog 
turntables and vinyl records. This has much to do with the consistency and stability 
of the technology. To stay relevant, hip-hop DJs had to possess the newest, latest, 
and hottest beats or mixes, which often required entering uncharted musical 
terrain. As the sonic compilations increased in complexity, the familiar became 
less familiar. However, as DJs, artists, and producers extended the boundaries of 
hip-hop, the technologies remained fundamentally the same and subsequently 
became foundational objects. As hip-hop narrativized itself and manufactured 
múltiple origins stories, some of the only things that could be viewed as original, 
authentic, and immutable were the turntable and vinyl records. 

By the late twentieth century, the compact disc had led to the demise of the 
audiocassette and relegated vinyl records to a minor commercial space inhabited by 
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audiophiles and collectors, of which some were DJs (Millard 1995). The illusion of 
the authentic analog hip-hop DJ weathering the digital storm, abetted by the narra- 
tive of hip-hop, empowered the DJ to keep the culture, craft, skill, and sound out of 
digital reach. This would all change in 2001, when Pioneer released the CDJ-1000. 
Until then, digital DJing had not been considered a viable possibility. Before the 
introduction of the CDJ-1000, rumors had been swirling about Pioneeres efforts, 
and it did not disappoint. This digital turntable allowed for the manipulation of 
CDs. With a jog wheel users could slow down, speed up, or scratch audio files 
similar to the way they handled vinyl. What set this device apart from prior digital 
controllers was that it was the first device to actually emulate vinyl and enable a user 
to manually control a CD within the device. Many smaller companies had tried to 
design and manufacture compact disc controllers that functioned similar to an 
analog turntable, but most did not succeed. They in part failed technically, but they 
also failed culturally by not fully understanding the users 5 needs and the resistance 
to technological change (Oudshoorn and Pinch 2003). However, Pioneer succeeded 
from a technical standpoint, which opened a cultural point of entry. This device 
changed the game. When the CDJ-1000 appeared, debates began to rage among DJs 
and hip-hop fans and in clubs but mostly on múltiple forums about the efficacy of 
the technology (e.g., its feel, latency, acoustic properties). The arguments regularly 
centered on authenticity and how to preserve it. 

A few African American DJs also felt that the digital turn was about erasing the 
roots of the culture and practice or, more important, its blackness. One New York 
DJ, DJ Soos, commented, “Who are all the vinyl DJs, the brothers? The digital DJs 
are rich white kids from the suburbs who can afford to spend $2,000 plus on that 
digital shit!” 2 As much as he did not want to talk about it, his comments ¿Ilústrate 
how the technological divide reinforced race and class difference. He was familiar 
with the digital-divide rhetoric of African American technological illiteracy and was 
concerned that this perception would be reproduced within black musical and cul- 
tural realms. This sentiment had everything to do with the history of hip-hop and 
how the DJ community identified with the artifacts its members used. Self-identified 
as a hardcore vinyl DJ, DJ Soos felt he was protecting the history, the culture, and 
his own identity from an outside technological, cultural, and racial onslaught. He 
was well versed in the unproven claim that young white men from the United States 
are the dominant consumers of hip-hop. He was equally knowledgeable about the 
Wholesale subjugation of African American people, and he saw the CDJ-1000 as 
merely an extensión of this tradition (Tate 2003). In this regard he commented, 
“Once again the white boy is coming to take our culture or, should I say, borrowing 
my blackness to make a buck? The reason they get into the digital shit is that they 
don’t have the skills for vinyl. They can borrow black culture but cannot feel it. 553 
Clearly what this DJ was getting at was that he felt that DJing was part of his culture, 
and he identified with the culture and the artifacts of this cultural community. His 
identity as a DJ was bound up in black technological aesthetics represented by vinyl 
records and turntables. Interspersed within his comments was a claim that 
white people could not culturally access hip-hop. He also contended that there was 
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something essential about hip-hop that can be understood only by black people. 
The response from a white DJ who appreciated vinyl but was primarily a digital DJ 
was “Psst. They just want to keep hip-hop black and for themselves. Hip-hop is for 
everybody!” Nonetheless, he acknowledged that a link exists between blackness, 
DJing, and hip-hop. He concluded: “All my heroes are black, and I do think of DJ 
and turntablism being a black-folks art form, but the game has changed. The 
technology has changed the game. You can’t deny a set of CDJs.” 4 These two 
commentaries ¡Ilústrate that the tensions between analog/digital and Technics 
12oos/CDJ-iooos had strong black/white racial overtures. In the context of the 
United States, it is nearly impossible to disentangle these issues and debates from 
this country’s troubling history of institutionalized racism. 

Race is relevant to black people around the globe; however, the politics of race 
vary greatly depending on local history, identity, and culture (Marable and Agard- 
Jones 2008). This American-centric black/white discussion is one of many that took 
place around the globe about the authenticity of analog and digital. However, what 
was at stake for everyone regardless of geographical location was the culture and 
practice of DJing and consequently hip-hop. The members of this ever-changing 
community agreed that the controversy was much less about the technological 
switch from analog to digital and more about the effect of this change on their col- 
lective identity and about who had control over the sound and means of hip-hop. 
If the technology on which the DJ community bases its collective identity changes, 
then the community changes, the culture changes, the historical underpinnings 
fade, and the community’ s collective identity might be destroyed. But no one 
wanted this to happen. No one wanted to undermine the art form and cultural 
practices of the community they loved. Even the most ardent vinyl DJ I interviewed 
would comment on how amazing the digital turntables were even though the 
DJ personally could not embrace them. The DJ’s identity and location within 
the larger hip-hop world were as a vinyl DJ, and that precluded embracing the 
nonanalog. 

As these events unfolded, small groups around the world were working on a 
technical solution to this problem. One avenue under development was custom-cut 
vinyl records. Smaller companies, like the now defunct Kingston, designed devices 
that straddled a turntable and received a music signal from an analog or a digital 
input source to drive a cutting blade to inscribe sound into a black vinyl disk. This 
process, if carefully done, allowed for a simpler means of producing individualized 
custom vinyl. The dream was to produce a readily available, low-cost, easy-to-use 
device for everyday DJs. The Japanese company Vestax was the only company 
to succeed in creating one of these devices with a price point in a consumeras 
reach (US$10,000). Announced in the summer of 2001, the Vestax VRX-2000 is a 
stunning and efficient machine, but by its own admission Vestax has not sold many 
in the last few years. 5 As enticing as this expensive device was for some, it was not a 
hip-hop community-wide solution to the analog/digital problem. The community 
was at a crossroads, with no apparent resolution in sight. However, two outside 
parties, one first in the Netherlands, and the next in New Zealand, designed and 
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invented Solutions that eventually were amenable to most DJs, turntablists, and the 
community and culture of hip-hop. 

On November 1 6, 1998, a small Dutch company, N2IT, distributed the 
following press release of their new device, FinalScratch (initially called Scratch): 

FinalScratch is the first Computer tool that lets DJs map digital media to spinning 
vinyl, the DJ’s musical instrument. The hardware/software combination gives DJs 
the ability to use turntables to synchronize and interact with various media types. 
FinalScratch’s architecture allows the user to synchronize múltiple tracks of audio 
and video and manipúlate them the way a DJ would. It lets users “needle drop” 
to set song position, change the RPM of the record for pitch and tempo 
manipulation, “scratch” by changing direction backwards and forwards, and 
synchronize each track’s tempo based on beats per minute. DJs no longer 
need múltiple hardware players to accomplish this task. Using FinalScratch, 

DJs no longer have to interact with digital media using a Computer mouse when 
mixing. (FinalScratch, 1998) 

On January 10-11, 1998, N2IT first exhibited this hardware/software combination 
at the BeOS Operating System Developer Conference. This might seem to be a 
strange place for such technology to debut but not when one understands that the 
developers of FinalScratch were programmers rather than DJs in any traditional 
sense. The lead developer, Marc-Jan Bastían, began working on FinalScratch as a 
technical problem presented at a hacker conference in the Netherlands in 1996. 
A discussion about digital music, vinyl records, and controllers resulted in a 
technical challenge, and Bastian led the way to find a solution. 

FinalScratch and subsequent digital vinyl systems allowed a user to control 
digital music on a laptop using specially encoded vinyl records and turntables as 
controllers. Proprietary software and an analog/digital signal converter facilitated 
the communication between the laptop and the turntables. On the screen a user 
saw a moving graphical representation of the music/sound encoded on the laptop. 
Users could manipúlate this data stream similar to the way they mixed, cut, and 
reassembled the information inscribed in vinyl records (Blum 2002). With a digital 
vinyl system, a DJ could create the illusion that no digital music was in use because 
it relied on familiar analog turntable techniques. Early promotional materials for 
FinalScratch indícate that the company saw its product as a bridge between an 
analog and digital divide. Specifically, one infographic shows the FinalScratch hard- 
ware device, or “hub,” centrally positioned on a bright red line. This line divides the 
image in half, with digital above and analog below. Above the red line and in the 
digital realm, a laptop is shown running the FinalScratch software. Below the red 
line and in the analog realm one sees two turntables and a mixer. Thus, the device, 
metaphorically and physically, enables DJs to maintain the historical, nostalgic, and 
authentic elements of the analog realm while simultaneously embracing the tech- 
nological innovations presented by the digital without undermining human con- 
trol over artistry and skill. 

Initially viewed skeptically by hip-hop DJs, techno/electronic DJs embraced the 
device. Clearly these different groups of DJs had only partially similar needs and 
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expectations. Techno/electronic DJs thrived with new technologies that enabled the 
Creative manipulation of sound. The two most central to its development were 
Richie Hawtin and John Acquaviva, who did not have the same racial and historie 
connection to the turntables as hip-hop DJs. For this community of electronic 
enthusiasts, it was a natural fit. Acquaviva would even say, tc We were techno DJs, 
but for all intents and purposes we were pretty conventional. FinalScratch made 
us practice what we preached” (Acquaviva 2002). The commercial versión of 
FinalScratch emerged in early 2002 with much enthusiasm but received mixed 
reviews. The main problem was that it was finicky, to say the least, with regular 
software crashes and connectivity problems of all types, as well as the need for 
significant amounts of processing power to make it run smoothly. The risk that it 
would fail during a performance was much too high for hip-hop DJs, but Acquaviva’s 
and Hawtin’s audiences were more forgiving with their technological experimenta- 
tion. The amount of online and offline chatter about this device was extensive, but 
the DJs that found themselves acting as translators between hip-hop and other 
musical genres often had the most interesting things to say. For instance, DJ Spooky 
wrote the following in response to a post about the troubling potential of the digital 
in 2001: 

The whole situation is what I’ve been talking about for a while. ... [T]his will 
bring digital music into a relationship with not only how we create sound but 
also how we think about file systems, media for data storage, and almost all other 
aspeets of how contemporary culture now is a culture of the “operating system” 
of networks of discourse. ... [I]t always makes me chuckle to see kids in Japan 
and Finland, and even like remóte Pakistán have a whole generation of budding 
cell phone composers and graphic designers. ... I cali it the Bin Laden effect — it 
makes people improvise their graphic design impulse in response to the social 
environment they find themselves enmeshed in, but of course, they also want 
sounds to accompany the process so they use cell phones to compose music 
for greetings and to select what kind of people they feel like hanging out with. . . . 

I tend to think that this will create some kind of compact multi-media platform 
that can handle almost all aspeets of digital creativity within a couple of years . . . 
think about how much the use of home computers and consumer audio 
electronics brought what was usually limited to academics and large corporations 
into the grasp of the average consumer . . . the kind of software that’s described 
in this article will pretty much do the same thing for digital sound production 
and all that that implies. (Miller 2001) 

Although DJ Spooky was clearly excited about its potential, many others were luke- 
warm. FinalScratch stumbled along, but soon hip-hop DJs like Jazzy Jeff began 
looking more favorably upon the technology in the months after the September 11, 
2001, attack on the World Trade Center in the United States and the explosión 
of digital file sharing. Traveling with crates of records was bulky but doable. 
However, after 9/11, Jazzy Jeff indicated that it had become extremely difficult and 
prohibitively expensive (Erenberg 2005). Extra baggage charges were cutting into 
his profits. With the rise of Napster and its file-sharing competitors, DJs could 
access vast troves of music quickly and, most important, easily. Digital file systems 
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allowed quick sorts through large numbers of records. Thus, DJs went from a few 
crates of carefully selected music to gigabit hard drives of digitally encoded music. 
A developing network of technical and social events began to make this sketchy 
technology look much better. Once DJs realized that they could still perform 
with turntables and vinyl and maintain their authenticity, it became all the more 
attractive. Early adopters were often apologetic about using the technologies and 
defensively stated that they used the devices only because it was the only way to 
make money while traveling internationally. Some were concerned about being 
construed as a digital DJ. FinalScratch eventually succumbed to technical problems, 
but, in 2004, collaboration between the highly respected Rane Corporation and 
Serato Audio Research, founded by Steve West and A. J. Bertenshaw, produced 
Scratch Live, which quickly gained wide acceptance among hip-hop DJs and became 
the market leader. 

This is not a case of winners or losers; the dynamics around analog turntables, 
vinyl records, and authenticity still persists. I contend that hip-hop is far from being 
in homeostasis but is tentatively lodged in an uncomfortable balance. In an 
interview about his use of Serato Scratch Live, DJ Jazzy Jeff, stated, “I still go out 
and buy records, but now I immediately record them digitally for Serato. I record 
direct from my DJ mixer and play the record straight. I don’t use any plug-ins or 
maximizers on my files before or after I record from vinyl because I want it 
to sound like a record.” Jeff finished this interview by repositioning himself as 
authentic by appealing to the art, skill, and craft of DJing: 

Im a purist — this [Serato] wasn’t supposed to work. I never in my life thought 
there would be a program like this. And I’ve been waiting for somebody in the 
crowd to get mad that Vm playing songs off my Computer. But more than 
anything, Eve gotten all the purists saying “Yo, that’s the most amazing shit in the 
world” ‘cause Em doing the exact same stuff. It does not change the integrity of 
DJing. It doesn’t make you DJ any better — the needle can still jump. If you are a 
bad DJ, with Serato, you still are a bad DJ. (Erenberg 2005, 96) 

Other DJs like Marley Mari welcome the technology and haven’t looked back. 
In response to the question about his affinity for technology he stated, “Technology 
today, you got to work with it because this enables you to do things quicker. Time 
is money . . . Come on, man, with technologies today I could do [beats] in my car 
on the way to the city. I could get on a flight to Japan and make two records and 
four beats before we land[ed]” (Creekmur 2006, 90). Nevertheless, there are still 
holdouts. One is Hank Shocklee, who is best known for producing the revolution- 
ary Public Enemy’s Fear ofa Black Planet. He laments the loss of the analog, and his 
thoughts about “feeling the music” resonate with the young African American DJ 
interviewed previously, DJ Soos. Shocklee argued as follows: 

Your enjoyment of hip-hop is lessened because you’re not listening to hip-hop 
in its true analog form. You’re listening to it in its digital form. Digital is a 
re-creation of hip-hop but it is not perfect. It’s like a photocopy of analog. 

Everybody says the golden era was the greatest moment in hip-hop. It was also 
the moment where everything was analog. You went from wax to drum machine 
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to tape ... As opposed to you going vinyl to drum machine to some sort of 
digital CD. That’s a whole different mechanism. This is what killed rock c n’ roll. 
Distortion does not register the same in the digital realm as it does in the analog 
realm. Because now distortion becomes a sound as opposed to having a physical 
presence. (Matthews 2006, 88) 

The tensión is still as much about the technology as it is about the sound. By 
focusing on the disagreements emerging as turntable signal processing moved from 
analog to digital, this chapter presents a case for the ways in which new technologies 
that were designed to medíate between analog and digital enabled hip-hop to repair 
the growing rift associated with the choice of tools one used. These technological 
synergies found in analog and digital — now known as digital vinyl systems — 
preserved collective and individual identities and maintained a user’s tactile and 
mechanical control while simultaneously embracing all that the digital had to offer. 
As a result of these new technologies the larger hip-hop DJ community was able to 
maintain itself and its collective and community identities. 

The intersections of hip-hop, technology, and sound help us to understand the 
ways the materiality of sound is embedded and circulated within society. In a 
most basic sense, as Trevor Pinch and Karin Bijsterveld suggest: “ ‘Follow the 
instruments’ in the same way that in the early days of S&TS we learned to Tollow 
the actors’ ”(Pinch and Bijsterveld 2004, 639). An examination of hip-hop 
technologies in use, as well as the cultural arenas in which individuáis, artifacts, 
and belief systems interact, links together the equally interesting but still under- 
developed work on race and technology. 


NOTES 


1 “The Reason Rack.” http://www.propellerheads.se/products/reason/index. 
cfm?article=devices_rack8dfuseaction=get_article (accessed August 1, 2009). 

2 DJ Soos, interview by author, Jan. 15, 2006. 

3 Ibid. 

4 DJ Light Flash, interview by author, Feb. 1, 2006. 

5 Hisao Kaneko, chief design engineer, Vestax Corporation, interview by author, 
May 31, 2007. 
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CHAPTER 22 


IPOD CULTURE: 
THE TOXIC 
PLEASURES OF 
AUDIOTOPIA 


MICHAEL BULL 


Introduction 


I have never cherished anything I bought as much as this little device. When I was 
a child, I used to watch a kids [sic] show called “the music machine” and I always 
dreamed of having something like that. A device that plays any song there is. The 
iPod comes pretty cióse to the fulfillment of this childhood fantasy. (iPod user) 

It has dramatically changed the way I listen to music. I use my iPod every day, 
generally for four to six hours a day. I listen to it at work, at home, in my car, on 
the subway etc. Whilst I frequently carried a personal CD player before, the iPod 
has become a necessity. When I leave the house, I now check my pockets for four 
things: My wallet, my keys, my mobile phone, and my iPod. I never go out 
without all four on my person. (iPod user) 

In July of 2007 the Guardian newspaper in the UK reported the case of a Muslim 
woman juror who had been discharged from a murder trial after she was caught 
listening to her iPod, concealed under her hijab, during important prosecution 
evidence. The judge had heard traces of a tinny music” throughout the trial but 
thought that it must have been either his imagination or a defect of his own hearing. 
The woman juror was subsequently charged with contempt of court. Perhaps the 
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female juror was picturing herself in a fictionalized courtroom, imagining the iden- 
tities of those around her, or perhaps she was merely lost in the soundtrack of her 
day — chosen from the múltiple playlists of her iPod. Given that the judge had heard 
her “tinny music” on several occasions it is unlikely that the juror had heard any of 
the court proceedings at all. The case reminds me of my own iPod research, in 
which a respondent who was a professor at an American university had stated that 
he had listened to his iPod while sitting on stage during his university’s graduation 
ceremony, passing the time in pleasurable auditory reverie. 1 

Both of these examples are very twenty-first-century examples of the privatized 
nature of much music listening — the iPod is a miniature device that fits discreetly 
into one’s pocket, and, once operational, it needs no further attention, unlike its 
predecessor, the Sony Walkman, where the user had to clumsily replace cassette 
tapes, CDs, or minidisks. The seamless and relatively invisible nature of use — the 
earpieces are designed to be discreet enables iPod users to create lengthy periods of 
potentially uninterrupted use during a wide variety of everyday activities. These 
examples involve risk — the woman juror was caught and prosecuted, and the 
professor would have been embarrassed and maybe disciplined if caught. They 
also demónstrate the pulí between the desire for individual auditory pleasure, 
which technologies such as the Apple iPod provide, and certain public prohibitions 
against that use. Both of the people mentioned are using the quintessentially mobile 
technology of the Apple iPod while sitting stationary in places they don’t particu- 
larly want to be in. These examples demónstrate that today any space whatsoever 
can potentially be transformed into a prívate auditory space of listening, thus 
empowering the users as they transcend the geography of the space inhabited, 
endowing it with their own auditory significance. It is precisely this ability to soni- 
cally transform increasing amounts of everyday experience that poses new ques- 
tions in relation to users’ underlying dispositions toward managing their daily 
experience. 

The study of iPod use can cast new light on users’ attitudes toward public 
places, others, and their own cognitive management of experience, thereby redefin- 
ing the power dynamic between the user and others and their own cognitive pro- 
cesses. MP3 technology has transformed music reception and users’ ability to pursue 
their pleasurable and privatized auditory interests. This capacity to sonically priva- 
tize space is possessed by the majority of those living in the industrialized world 
through the use of dedicated MP3 players like the Apple iPod or through mobile 
phones, which possess MP3 capability. Use occurs in any area of everyday life, from 
the domestic environment of the home, to the impersonality of the Street, to the 
working spaces of the office, to those engaged in sports, and to the exceptional 
spaces of the theaters of war in Iraq and Afghanistan (Pieslak 2009). 

The untrammeled pleasure of taking your own soundworld with you resonates 
through urban and cultural theory. It poses a set of theoretical problems relating to 
the nature of public and prívate existence and to how urban dwellers experience 
urban space. On the one hand, the continuous use of these technologies might be 
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interpreted as an act of liberation — giving users increased control over their 
environments and themselves (de Certeau 1988). Alternatively, as in the Marcusian 
analysis, use is interpreted as an act of colonizaron, whereby users become increas- 
ingly dependent upon the use of these technologies in order to satisfactorily survive 
and manage their daily routines. In the following pages I investígate iPod use 
through the lens of forms of toxic audiotopias. The epistemological starting point 
is the observation that the history of cides is in some sense a history of how we come 
to share social space. If this is the case, we are entitled to ask, What type of culture 
is the audiotopia of iPod use, and what does it tell us about the contemporary nature 
of mediated public and prívate urban space? 

The privatizing and colonizing impulse, which I associate with iPod use, has a 
long prehistory. We have throughout the twentieth and twenty-first centuries 
increasingly moved to music both through the provisión of Fordist technologies, 
such as those created by the now defunct Muzac Corporation, which created sonic 
environments of uniformity for consumers to move through, or by the Hyper- 
Post-Fordist technologies of MP3 players such as the Apple iPod, through 
which, with the aid of a pair of headphones, users create individualized and mobile 
soundscapes. 

The múltiple provisión and use of these technologies in the age of mechanical 
reproduction has produced an age of sonic saturation and colonization in which 
urban spaces — both prívate and public — are transformed, resulting in the 
continual redefinition of and contestation of the meanings attached to the way in 
which individuáis inhabit space and place. Early users of the gramophone and radio 
frequently sequestered social space collectively by listening on public beaches or 
during family picnics. From the 1950S on, the mass use of portable radios redefined 
the meaning of public space (Douglas 1999). More recently, ghetto blasters 
and automobile sound systems have privatized public space (Gilroy 2003). In its 
fluidity, sound has seeped into the spaces of everyday life like no other sense. 2 This 
“colonization” extends to the cognition and desire of users themselves. Importantly, 
this is frequently a “desired,” “active,” and pleasurable colonization, whereby users 
reclaim experience, time, and place (Bull 2007). It is precisely in this area of the 
intoxicating immersion of the sonic that the twin concepts of “audiotopia” and 
“toxicity” are situated in the following pages. The chapter focuses primarily upon 
the use of Apple iPods as the most common, dedicated MP3 player and because my 
primary research is located in their use. However, the following analysis is equally 
relevant for other MP3 players and users. 


Audiotopia, Toxicity, and Pleasure 


“Audiotopia” refers to both the intense pleasure described by iPod users as they 
listen to music on their iPods and to their desire for continuous, uninterrupted use. 
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Audiotopia comprises the desire for immersive auditory experience whereby the 
users’ chosen soundworld eradicates the preexisting soundworld that users inhabit. 
Francés Dyson comments upon the intímate relationship between sound technolo- 
gies and immersion: “Sound is the immersive médium par excellence. Three dimen- 
sional, interactive, and synesthetic, perceived in the here and now of an embodied 
space, sound returns to the listener the very same qualities that media mediates: 
that feeling of being here now, of experiencing oneself as engulfed, enveloped, 
enmeshed, in short, immersed in an environment. Sound surrounds” (Dyson 
2009, 4) (see also Grimshaw, this volume). The mediated re-creation of the users’ 
sense of the “here and now” is central to the following analysis of toxic audiotopia. 
Instrumental in creating this immersive State is the use of headphones, which 
transforms the users’ relationship to the environment and creates sonic privacy. 
The Science fiction writer William Gibson captures the power of sonic privatization 
in his description of using his Walkman, the iPods’ precursor: “The Sony Walkman 
has done more to change human perception than any virtual reality gadget. I can’t 
remember any technological experience since that was quite so wonderful as 
being able to take music and move it through landscapes and architecture” 
(Gibson 1993). 

Intrinsic to auditory privatization is headphone use. The habitual use of 
headphones began with radio reception in the 1920S. Siegfried Kracauer 
commented as early as 1924 on the transformative and seductive power of radio 
listening through headphones: 

[W]ho would want to resist the invitation of those dainty headphones? They 
gleam in living rooms and entwine themselves around heads all by themselves . . . 
Silent and lifeless, people sit side by side as if their souls were wandering about 
far away. But these souls are not wandering according to their own preference; 
they are badgered by the news hounds, and soon no one can tell anymore who 
is the hunter and who is the hunted. (Kracauer 1995, 333) 

Contemporary iPod users, in contrast to Kracauer’s radio listeners, claim to 
be masters of their sonic world as a result of the personalized choice of music 
embedded in their iPods. The impulse to construct a privatized auditory environ- 
ment is itself located in Western cultural valúes. Jonathan Sterne notes that: 

as a bourgeois form of listening, audile technique was rooted in a practice of 
individuation: listeners could own their own acoustic spaces through owning the 
material component of a technique of producing that auditory space — the 
“médium” that stands in for a whole set of framed practices. The space of the 
auditory field became a form of prívate property, a space for the individual to 
inhabit alone. (Sterne 2003, 160) 

Headphones reempower the ear against the contingency of sound in the world, 
bolstering the individualizing practices of sound reception. The transformative 
nature of using headphones is illustrated by the following iPod user: 

Although being alone listening to music through conventional speakers can make 
listening pleasurable, having pod headphones in can make that sensation 
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infinitely more pleasurable. The feeling of blocking out other sounds and by 
implication other sources of interference only heightens the pleasure. You may 
also be made aware of subtle sounds that you can only hear when wearing 
headphones. This “new disco very” enhances it still further. You may have heard a 
track dozens of times via speakers and never heard a particular piece of music 
before just because you were wearing earphones. (Hillary) 

Embodied in these practices is the desire to control the sonic environment. Many 
iPod users claim to prefer to live in a totally mediated, privatized auditory world 
with their music collection at their fingertips. They are not merely enjoying their 
own privatized auditory reveries but are also taking pleasure in being able to soni- 
cally control the very nature of their everyday life. It is here that the sonic 
pleasure of audiotopia resonates socially. Music reception for iPod users is no 
longer the straightforward “mutual tuning relationship” that Ernst Bloch com- 
mented upon. He distinguished music from the visual by arguing that “a note of 
music comes with us and is Ve’ unlike the visual, which is primarily an T divorced 
from the other” (Bloch 1986, 287). The relational aspect of iPod use differs inso- 
much as experience becomes primarily an audiovisual one in which States of medi- 
ated “we-ness” eclipse direct forms of experience by technologically mediated forms 
of experience (Adorno 1976; Dant 2008). The mutuality of music reception becomes 
increasingly asymmetrical in its privatizing, yet mediated nature because users are 
immersed in the chosen sounds of the culture industry contained in their iPod. 
Sounds fed directly into the users’ ears through headphones often placed directly 
into the ears — directly into the experiencing subject — act so as to reduce the out- 
side world to silence. In its immersive qualities, mediated sonic experience often 
appears to be more “immediate” to users of iPods than nonmediated experience . 3 

The mediated immediacy of iPod use may lead to a form of social toxicity. The 
term toxicity should, however, be handled with caution. Western consumer 
relationships to a whole host of consumer practices and technologies from shop- 
ping, gambling, and watching televisión to playing video games has frequently been 
described as compulsive — yet compulsive behavior need not be equated with any 
detrimental social attribute or harmful effect. The inability to stop reading a thriller 
novel, for example, because one is engrossed in it might be considered in positive 
terms. In the main, however, “toxicity” appears largely as a dystopian State whereby 
users’ engagement with pervasive new technologies potentially decreases their 
capacity to disconnect from their use of these technologies (Rheingold and 
Kluitenberg 2006, 29). Toxicity refers primarily to the negative moment embodied 
in mediated technological connectivity. This has a long cultural and theoretical 
history stretching from Heidegger to Marcuse. Marcuse commented as early as the 
1960S that “solitude, the very condition which sustained the individual against and 
beyond society, has become technically impossible” (Marcuse 1964, 68). For Marcuse 
this referred to both the social construction of mediated subjective desire and the 
nature of social spaces that were a consequence of such a disposition. The introjec- 
tion of a range of social dispositions to consume was to be understood through the 
notion of “objective alienation,” whereby users had fully normalized alienated forms 
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of experience. In this image of “toxicity,” consumers become victims of the market- 
ing strategies of companies and the design and form of their technologies. Implied 
in Rheingold and Kluitenberg’s understanding of “toxicity” is the users’ inability to 
experience forms of nonmediated experience they sometimes desire. Chatfield 
points to the desire for “toxic” immersion among video-game users: 

People are perilously drawn by temptation to withdraw from real life’s 
complexities into a solipsistic, simpler world. In the case of video games, in 
particular, it’s hard to deny this kind of escape isn’t a large part of their appeal. 

Escape, simplification and control play their part in all games and in electronic 
games has reached a remarkable pitch of sophistication.” (Chatfield 2010, 73) 

In the following pages I problematize, develop, and reevaluate the notion of 
“toxicity” in relation to the specifically auditory dynamic of iPod use. Before doing 
so I wish to clarify the position taken in the following pages. I do not claim that all 
iPod use is toxic in nature but merely that it is one, albeit common, possibility of 
use. Neither do I claim that users who embody potentially “toxic” modes of usage 
are fundamentally different from users who do not. Toxicity is merely a structural 
possibility of use that becomes socially significant with mass usage as described 
earlier. If “toxic pleasures” relate to a continuous withdrawal from the physical 
immediacy of experience and its re-creation engendered through the continuous 
creation of privatized sonic environments, then we require a nuanced analysis 
of both the pleasures and the consequences of such practices and a normative 
understanding of users’ rationale, which underpins such practices. 

Toxicity is divided into three categories in the following analysis: In the first 
instance toxicity refers to users’ inability to disconnect from use despite their poten- 
tial realization of the problematic nature of use. Central to this form of toxicity is 
the seductive nature of an empowering, anesthetizing potential embodied in iPod 
use. Second, it refers to modes of habitual use whereby users do not experience this 
“tethering” as at all negative but merely pleasurable. “Toxicity” in this case refers to 
the negative appraisal of the shared nature of social space, which is to be replaced 
with the audiotopia embodied in iPod use. Third, “toxicity” refers to the actual 
physical nature of damage that can occur through loud, continuous use of the iPod 
and need not require any cognitive recognition by the user of this possibility. 
If there is recognition of this potential, then toxicity might refer to “state one,” 
where users find it difficult to change their use. 


Toxic Pleasures 1: Seductive 

Audiotopia 


Digitalization has enabled iPod users to stream music for every conceivable 
situation through the creation of playlists, which can be created and changed while 
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on the move. Users possess unparalleled control over their daily sonic life both 
externally, via the aestheticization of their experience, and cognitively, through 
their ability to regúlate their moods and volition through the micromanagement of 
their music listening. Perpetual sonic connectivity becomes a seductive audiotopia 
in the daily lives of many iPod users, transforming their relationship to the environ- 
ment they inhabit and move through. Embodied in iPod use is a range of sonic 
rituals that structure users’ daily life, and as such the iPod might be understood as 
an urban “Sherpa,” enabling users to successfully maneuver their way through daily 
urban life. However, recognition of this pleasurable, hermetically sealed activity 
sometimes produces an ambivalent response: 

I walked around with a Sony Walkman attached to me whilst walking or roller- 
skating my way around many major cities in the world during the 1980S. In 
retrospect I wish I had not been so stuck in my own head, so disconnected from 
the natural symphony of place. My community members feel so alien to me now. 

They stand in line in front of me, dancing ever so slightly to their tune, often 
oblivious to what’s happening around them and completely closed off from the 
niceties of the neighbourly “helio.” (Alison, hppt://www.radioopensource/ 
the-age-of-shuffle/) 

Alison, an ex-Walkman user reminds us of both the historical nature of auditory 
privatization and the desire for a shared public space in the city. The writer Gabriel 
Sherman describes his own ubiquitous iPod use in the following terms: 

Almost anywhere I went, I plugged in and tuned out. Need cash from the ATM? 

The Shins’ melodic New Slang would accompany me. Picking up my laundry at 
the Wash and Fold? How about Rachael Yamagata’s sultry swooning? My music 
even joined me in the bathroom each morning before work. With more than 
1,000 songs at my thumb tip, I could satisíy any desire, any time. My iPod was 
like a drug. I live in my own self-imagined movie, instantly tailoring the sound- 
track to fit, or inspire, my emotions ... I even acquired the telltale signs of an 
addict. Just before leaving places, I fidgeted nervously while contemplating what 
song I would queue up. (Sherman) 

Sherman describes the ambiguous pleasures of living in a pleasurable audiotopia; 
his use highlights both the utopian and dystopian elements of iPod use. This 
dystopian moment of use I refer to as the toxic pleasures of use. “With my 
earphones in, I became deaf to the urban orchestra playing around me. Even worse, 
my iPod had sapped the energy that makes New York more exhilarating than the 
places we all escaped from. I had traded one kind of suburban isolation for another.” 
Sherman, like many other users of a range of mobile technologies from the mobile 
phone to the iPod, finds himself increasingly dependent upon these technologies to 
maintain his daily life, of being “tethered” to a wide range of communication 
technologies, of being available 24/7 (Ito, Okabe, and Matsuda 2005). Sherman 
struggles with the ambiguous nature of his iPod use, realizing that he is caught 
between the pleasures of a privatized aesthetization of the city and his compulsive 
use pattern. His description highlights the multifaceted pleasures associated with 
continual or habitual use. The transcendence of the “here and now” is embedded in 
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the seductive pleasures of sonic toxicity, and this transcendence is frequently aes- 
thetic in nature. 

The aesthetic colonization of urban space described by many iPod users is in 
part a technological tale whereby urban experience becomes synonymous with 
technological experience. This technological experience is both pervasive 
and increasingly taken for granted in wide areas of daily life. It is simultaneously 
empowering and dependent. 

IPod use seamlessly joins together disparate experience by unifying the 
complex, contradictory, and contingent nature of the world beyond the user. The 
success of these strategies depends upon the creation of an all-enveloping wall 
of sound through which the user looks. Users report that iPod experience is at its 
most satisíying when no external sound seeps into their world to distract them from 
their dominant and dominating visión. 

IPod users invariably aim to create a privatized sound world that is in harmony 
with their mood, orientation, and surroundings, enabling them to respatialize 
urban experience through a process of solipsistic aestheticization. IPod users aim to 
habitually create an aesthetically pleasing urban world for themselves as a constitu- 
ent part of their everyday life. In doing so, they create an illusion of omnipotence 
through mediated proximity and “connectedness” engendered by their iPod use. 

For many iPod users the Street is orchestrated to the sounds of their favorite 
playlists: 

The world looks friendlier, happier, and sunnier when I walk down the Street 
with my iPod on. It feels as if I’m in a movie at times. Like my life has a sound- 
track now. It also takes away some of the noise of the streets, so that everything 
around me becomes calmer somewhat. It detaches me from my environment, 
like Fm an invisible, floating observen (Berklee) 

Susan, a manager from Toronto, describes the iPod’s transformative power over 
her urban environment: 

I find when listening to some music choices I feel like I’m not really there. Like 
Fm watching everything around me happening in a movie. I start to feel the 
environment in the sense of the mood of the song and can find that I can start to 
love a Street that I usually hate or feel scared for no reason. (Susan) 

I see people like I do when I watch a movie . . . there is a soundtrack to my 
encounters . . . music to accompany my thoughts about others. It dramatizes 
things a bit. It filis the silent void. (lune) 

Streets perceived as silent are in reality a complex of sounds — June’s observation 
that her iPod filis the “silent void” is indicative of users’ experiencing the world 
solely as a function of mediated sound. Mundane, yet nevertheless unmanageable 
urban life is transformed through iPod use, which creates movement and energy in 
the user where there was none before. The use of the iPod provides a “buffer” 
between the user and the recognized reality of the city Street. 

Users often describe the world experienced through iPod use as a movie script 
in which they play the central role. The selection of “sad” music, for example, is 
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used to match the users’ mood, transposing those feelings to the streets passed 
through. The world and the users’ experience within it gain significance precisely 
through the creation of an enveloping and privatized soundworld. In the users’ 
world of aesthetic euphoria, experience is simplified, clarified — the aesthetic 
impulse provides an unambiguous sense of purpose and meaning for users, who are 
creating a “space” within which to unwind and unravel their emotions. When 
attended to, the Street becomes a function of their mood and imagination mediated 
through their iPod. 

IPods are both interactive and noninteractive in the sense that users construct 
fantasies and maintain feelings of security precisely by not interacting with others 
or their environment. Through the power of a privatized soundworld the world 
becomes intímate, known, and possessed. Imagination is mediated by the sounds of 
the iPod, which become an essential component in the users’ ability to imagine at 
all. Users are often unable to aestheticize experience without the existence of their 
own individual soundtrack acting as a spur to the imagination. 

In this ordering of cognition the user surpasses the disjunction that exists 
between their own soundtrack, the movement of others, and the environment they 
pass through. Without the iPod, they experience the world out of sync. The 
polyrhythmic nature of the city relativizes their own place within the world and 
makes them just one more piece of an anonymous urban world. 

Toxicity and aestheticization can be interpreted as having utopian implications 
for users — to aestheticize is to transcend the mundane world as it is experienced — 
aestheticization remains an active mode of appropriating the urban, transforming 
that which exists, and making it the user’s own (Marcuse 1964). In this instance 
iPod use reempowers the subject in relation to urban space. The French 
anthropologist Marc Augé has described cities in terms of a non-spaces”: spaces 
semiotically void of interest such as shopping centers and parking lots. Now, users 
can re-create meaning for themselves. Through the privatized experience of music, 
they subjectively endow all space with meaning. The use of Apple iPods permits 
urban theorists to increasingly understand the city as a privatized audiovisual 
creation of the user and to reprioritize the unique role that sound plays in the 
construction of daily life (Bull 2000, 2007). Now any site of use can be refashioned 
to mimic the auditory desire of the user — in essence all spaces are “nonspaces” 
to be refashioned by iPod users. 

In addition, iPod users transform the world into conformity with their 
predispositions — the world becomes part of a mimetic fantasy in which the “other- 
ness” of the world in its various guises is negated. This is an important strategy for 
iPod users who subjectivize space — consume it as if it were a commodity. In the 
process, immediate experience is fetishized. Technologized experience can be 
understood as fetishized experience — experience becomes real or hyper-real 
precisely through its technologization — through technological appropriation. 
The utopian impulse to transform the world occurs only in the imaginary — in its 
technologized instrumentality, the world remains untouched. The use of the iPod 
provides a buffer between the user and the recognized reality of the city Street, 
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invoking Kracauer’s observation that ÍC the worlcbs ugliness goes unnoticed” (1995). 
Users prefer to live in this technological space whereby experience is brought under 
control — aesthetically managed and embodied — while the contingent nature of 
urban space and the “other” is denied. 

Yet, in this denial of contingency lies a liberating moment understood as a form 
of reenchantment. I suggest that in this positive moment iPod practices can be 
understood as the coming together of a reenchantment of the city, understood 
through Walter Benjamin’s concept of “aura” and Michel de Certeau’s notion of 
Street “tactics.” IPod users are, in effect, individualizing each journey while simul- 
taneously making urban space their own. In their auditory empowerment they 
create their own, unrepeatable, individual journey. In doing so they create, contra 
Walter Benjamín, their own auratic experience of the city — nonreducible to any 
other as it is framed — indeed, made through the mediated sounds emanating from 
their iPod. 


Toxic Pleasures 2: The Spatial 
Secession of iPod Culture 


Just as Sherman becomes aware of the “secessionist” potential of the iPod, so for 
other users it is precisely this secessionism that is attractive. The negative moment 
of the auditory pleasures of privatized listening lies precisely in the fictionalization 
or negation of the “other,” whereby forms of urban reciprocity and of urban recog- 
nition are denied. In enacting these strategies, the empowerment of the subject 
within iPod use problematizes the very way in which users might “recognize” others 
(Honneth 1995). 

In the following pages I appropriate Henderson’s notion of “spatial secession” 
in order to view this “negative” moment of iPod use. Henderson, who was primar- 
ily concerned with the división of space through automobile use, argued that 
automobility was often embraced as a tool of “spatial secession” rather than as 
a pleasure in itself: “[MJobility is not just movement but also an extensión of ide- 
ologies and normative valúes about how the city should be configured and by 
whom” (Henderson 2006, 295). Secessionist automobility refers to the use of a car 
“as a means of physically separating oneself from spatial configurations like higher 
urban density, public space, or from the city altogether” (Henderson 2006, 294). 
Secessionist valúes equally play a central role in the world of the iPod user, from the 
Street to the automobile — users enact isolationist strategies with their headphones 
on or encased within the shell of the automobile, with the iPod docked into its 
sound System. 

IPod use embodies forms of urban retreat, which has become a dominant met- 
aphor in the dystopian image of urban life, whereby urban citizens attempt to 
maintain their sense of “self ’ through a range of distancing mechanisms from 



536 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


the “other. 55 The iPod is the latest technological addition to the urban citizerTs 
ability to neutralize urban space. 

Benjamín understood the city as reducing the subject’s capacity for thought 
due to the need for constant response to the contingency and plenitude of urban 
life. Urban “technology [had itself] subjected the human sensorium to a complex 
kind of training 55 (Benjamín 1973, 171). This training has enabled the subject to 
retreat from urban space by neutralizing it. Technologies of separation such as the 
iPod have progressively empowered urban citizens precisely by removing them 
from the physicality of urban relations. Urban experience undergoes a “ghettoiza- 
tion” through these secessionist strategies, furthering the technological interven- 
tion of subjects intent on urban retreat, transforming the polyrhythmic nature of 
urban space into the interiorized, monorhythmic sounds of users: “It 5 s as though 
I can part the seas like Moses. It gives me and what’s around me a literal rhythm, 
I feel literally in my own world, as an observer. It helps to regúlate my space so I can 
feel how I want to feel, without external causes changing that” (Susanna). 

IPod users invariably never willingly interact with others while engaged in 
solitary listening; interruption destroys the seamless reverie of use. It also indirectly 
relates to the silencing of experience in the city, which predates iPod use (Sennett 
1994). The iPod in this instance acts as both a “gating” and a “tethering 55 technol- 
ogy. Indeed, gating assumes a metaphysical stance in iPod use: The drawing of 
circumscribed circles around the subject is both physical and metaphorical. This is 
one of the historical legacies of the valúes of individualism and the concurrent 
demand for privacy — an overriding desire to be left alone while in urban culture. 
IPod use increasingly becomes a habitual “mode of being in the world” 
(Geurts 2002, 233), in which users choose to live in an increasingly privatized and 
“perpetual sound matrix” through which they “inhabit different sensory worlds 55 
(Howes 2004, 14) while sharing the same social space. 

The public placing of and consumer use of a range of technologies enables 
the urban citizen to carry out most traditionally public tasks with little or no 
interpersonal contact, which furthers the architecture of isolation; exchanges are 
increasingly taking place between subjects and machines in urban culture, making 
interpersonal exchange obsolete. Cognitively, consumers increasingly expect, feel 
comfortable with, and desire no communication while out in public: 

Tracy expects to have wordless interactions with store attendants. When given 
a choice, Tracy will also use the “You-Check” line recently available in her 
neighbourhood’s Fred Mayer, which allows her to sean her own purchases 
and credit card: “I love it,” she says. In everyday life, she wants to a get in and 
out.” (Jain 2002, 394) 

The normative foundation of the “nonplaces” of urban culture becomes etched 
into consumers 5 social expectations as they partake of “public culture. 55 

Physical and cognitive zoning becomes second nature in urban culture: “[T]he 
basic idea of zoning is that every activity demands a sepárate zone of its own 55 
(Kunster 1998, 120). Urban infrastructures increasingly complement a range of 
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technologies from the automobile to the iPod. These technologies emphasize con- 
nectivity within circumscribed circles and spaces while simultaneously alienating 
subjects from the copresence of one another in public space, as the following iPod 
users describe: 

I rarely even speak — I just hand them my credit card and say thank-you. (Mark) 

I tend not to notice people when Im plugged in. Im usually too 
preoccupied with myself to look at others. (Elizabeth.) 

It removes an external layer. I see people and things as inanimate or 
not-fully-connected. It seems that I have an external connection they lack. 

It’s quite odd, actually . . . Yes. With the iPod and news talk radio files, I am 
having an interactive session with the anchor. When I look at the people around 
me, they appear to be two-dimensional and without significance. (Jonathan) 

Progressive withdrawal from the cosmopolitan city both motivates iPod use and 
furthers it: 

I then started wearing it [the iPod] while shopping. I did it to control my 
environment and desensitize myself to everything around me. What I found 
interesting was that the more I wear my iPod, the less I want to interact with 
strangers. Pve gotten to the point where I don’t make eye contact. I feel almost 
encased in a bubble ... I view people more like choices when Im wearing my 
iPod. Instead of being forced to interact with them, I get to decide. It > s almost 
liberating to realize you don’t have to be polite or do anything. I get to move 
through time and space at my speed. (Zuni) 

To be separated from others is simultaneously a mark of distinction (Bourdieu 
1986) and a mark of alienation. City spaces become enacted spaces that are 
enjoyed and modified (de Certeau 1988; Lefebvre 1991) and also structural spaces 
that contextualize all behavior. 

IPod users frequently glide through the urban Street silently, and silence 
is imposed upon others as they passivize the looks and remarks of others. 
The silencing of the “other 55 is a strategy of control that represents a refusal to 
communicate with others in public. The users 5 privatized sonic landscape permits 
them to control the terms and condition of whatever interaction might take place, 
producing a web of asymmetrical urban relations in which users are invariably in 
control: 

A person with headphones on gives off an appearance of not wanting to be 
disturbed. There are times, mostly at work or walking to and from work, when I 
just want to be left alone. Wearing the iPod insulates me from other people in my 
surroundings. (Amy) 

Users are aware of the symbolic meaning of the white wires of the iPod dangling 
from their ears: a combination of distinction and power. Empowerment is a prod- 
uct of withdrawal, and withdrawal creates an empowering sense of anonymity: 
“I use my iPod in public as a ‘privacy bubble 5 against other people. It allows me to 
stay in my own head 55 (John). 

IPod use permits users to redraw or redefine “personal space. 55 Goffman con- 
sidered personal spaces to be “the space surrounding an individual anywhere within 
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which an entering causes the individual to feel encroached upon” (Goffman 
1969, 54). Goffman’s own definition represented an unrecognized historical moment 
of bourgeois urban sensibility — the entitlement to personal space. IPod use 
enhances this sensibility toward the ownership of space whereby the encroachment 
of others loses its physicality as users fail to notice or respond to the physical touch 
of others (Pinch 2010). 

Forced interruption is invariably experienced negatively: “Sometimes I feel 
violated if I have to turn it off for an unplanned reason.” The breaking of the 
auditory bubble represents recognition of the fragility or contingency of auditory 
empowerment. An involuntary and sudden return to the world, as others experi- 
ence it, is invariably experienced as unpleasant. Maintenance of control frequently 
implies a denial of difference: 

In America, people are often loud and rude and it’s sometimes hard to 
concéntrate effectively. In Phoenix, we have a lot of Mexican immigrants. They 
don’t learn English and they have no control over their children. I believe in 
mutual respect when in public places. It was becoming increasingly difficult for 
me to shop without encountering a bombardment of Spanish or screaming kids. 

The iPod lets me filter them all out. Im much calmer now when I shop. The 
iPod lets me overlook the lack of courtesy. Using the iPod helps control my 
concentration. Since Im familiar with the music, I can let it float to the back 
of my consciousness. (Tracy) 

Tracy achieves a State of equilibrium precisely by withdrawing into herself. 
Sennett has described this form of behavior as representing “an early sign of the 
duality of modern culture: flight from others for the sake of self-mastery” (Sennett 
1990, 44). The secessionist practices of iPod users embedded in this second form 
of toxic audiotopia reflects a negative moment of urban experience in which the 
urban subject uses a range of communication technologies to partition and remake 
experience. In doing so the very nature of collective urban space is thrown into 
question. 


Toxic Audiotopia 3: Damaging Silence? 


Toxic audiotopia also has potential physical consequences for users. The primary 
aim of many, as we have seen in this chapter, is to immerse themselves in the sonic 
environment created by the iPod. To achieve this, the sounds of the outside world 
are canceled out. This results in high volumes of listening: 

I usually turn it on as loud as it will go or until it sounds crappy. I don’t like 
background noise when I am listening to my iPod. (Mary) 

Always loud — I can never hear anything around me. (Andy) 

The seductive nature of the iPod — the proximity of loud music is also potentially 
toxic to the users 5 own sense of hearing. This third form of toxicity deais with the 
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actual physical damage to users’ hearing that results from loud and continuous 
usage. In addition to listening at loud volumes, users increase their music listening 
on average twofold with the purchase of an iPod. Music also does not appear to 
suffer from the routinization effect of other mobile technologies such as the mobile 
phone. After two years in follow-up interviews, respondents claimed to have 
maintained their level of use over the previous two years. 

A recent study of sonic impairment published in the United States found that 
1 6 percent of American adults have some degree of hearing loss. It was estimated 
that iPod users who turn up the volume to about 90 percent for on average two 
hours a day, five days a week will develop significant hearing loss. One author of the 
report stated that ct one patient I had used his headphones instead of earplugs when 
he was on his construction job. He thought as long as he could hear his music over 
the sound of his saws, he was protecting his ears — because he liked the sound of his 
music but didn’t like the sound of the construction noise. He has a good 50 dB to 55 
dB of noise-induced hearing loss at 28” (Portnuff 2009). 

IPod users vary the volume in accordance with their surroundings — the more 
the background noise, the greater the volume. For example, the use of an iPod on 
the underground system, where the ambient sound is high, encourages users to 
listen at near máximum volume; also, the volume used steadily increases without 
the users’ conscious awareness as they continually attempt to compénsate for the 
potential intrusión of ambient noise. 

Portnuff (2009) found that teenaged iPod users not only played their 
music louder than older users but also were also unaware of how loud they 
were playing it. The playing of music at máximum volume for a mere five minutes 
a day could impair hearing permanently. The European Union’s Scientific 
Committee on Emerging and Newly Identified Health Risks (SCENIHR) published 
its research findings in 2008 and found that that the numbers of young people 
with dangerous levels of noise exposure has tripled in the last twenty years while 
occupational noise levels had decreased in the same period. Although SCENIHR 
found that the majority of MP3 users were not at risk, it also found that 5-10 
percent of users were at high risk due to their patterns and duration of use; these 
users listened to music for more than one hour a day at a high-volume control 
setting. The report concluded that the numbers of EU nationals subject to 
both temporary and permanent hearing loss was on the order of anything up to ten 
million. 

Rawool and Colligon-Wayne (2008) found that general patterns of music 
reception among American youth tended to favor an intense auditory experience — 
that part of the pleasure was precisely the physicality of sonic intoxication. They 
found that “behavioural patterns consistent with a subjective sense of being addicted 
to loud music existed in 9% of their participants” (Rawool and Collington-Wayne 
2008, 5). The intoxicating sound of music pumped directly into the ears constitutes 
a central element of pleasure for many users. The toxic damage to the ears is 
often not recognized or is traded in terms of immediate pleasures verses long-term 
potential damage that for many users remains an abstract possibility. 
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CONCLUSION 


In this chapter I have analyzed the nature of the pleasures of auditory toxicity, 
which goes beyond the proprioceptive into the nature of our social world and the 
communication technologies that citizens habitually use. In doing so I recognize 
that iPod use should not be divorced from a range of other media and communica- 
tion technologies habitually made use of. For example, automobile use, with its 
combination of secessionist cocooning and pleasures of mobility, are incorporated 
and developed through iPod use, which in turn is embedded in historical modes of 
portable radio reception. The nature of use is also embedded in how urban theory 
has understood and explained what living in the city consists of. Ever since Simmel 
argued that individualism in the city was premised upon the neutralization of 
the other through the construction of a “blazé” attitude, in which the “other” was 
effectively neutralized, the trope of urban retreat has featured heavily in urban 
literature (Sennett 1990, 1994). In contrast to this, writers from de Certeau onward 
have offered a more positive evaluation of an open sensory sensibility to urban 
experience that equates that experience with an openness to a wide array of experi- 
ence that focuses upon the recognition of “difference,” upon which self-realization 
is to be understood. Yet, following Walter Benjamin’s assertion that technology has 
trained the human sensorium, the chapter has asked what type of training the use 
of technologies like the iPod signifies. The answer appears to be a complex mix of 
secessionism, creativity, and toxicity. The intense sonic immersion embodied in 
iPod use itself contains elements of both toxicity and creativity. This duality of use 
produces its own paradoxes as evidenced in the following iPod user’s comment: 
“I didn’t realize how much I yearn for control and probably peace and quiet. 
Strange, since Im blasting music in my ears. I think Im really tired of living on 
someone else’s schedule. The iPod has given me some control back” (Janet). 

Janet’s audiotopia is based upon a defensive understanding of the lack of power 
embodied in nonmediated experience. The noise of uncontrolled culture is 
managed through the immersive sounds of her iPod, producing a sense of cognitive 
ease and silence. Thus lies the paradox of toxic audiotopias: Sound produces silence, 
connectivity produces separation, and mediated toxicity produces control. IPods 
are one element of the changing sound matrix of contemporary culture. The 
paradoxical use of Janet and other users needs to be put into a wider social context 
of urban separation and control in which the technology of the iPod is merely one 
more timely technology. 


NOTES 


1 The following empirical examples derive from primary research undertaken in 2005 
and 2006. More than a thousand iPod users filled out a thirty-four-question questionnaire 
over the Internet. The respondents answered requests posted in the New York Times , 
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BBC News Online , Guardian Online , Wired News , and MacWorld. These requests were then 
syndicated and replicated in a wide a variety of newspapers and magazines worldwide. 
Respondents carne mainly from the United States, the UK, Cañada, Australia, and 
Switzerland but also included fewer responses from France, Italy, Spain, Denmark, 

Finland, and Norway. Twenty percent of respondents were then asked follow-up questions 
in response to their initial answers. In addition to the Internet sample, a few UK users 
have also been interviewed face to face. Selected follow-up interviews were conducted 
in 2007. 

2 Teenaged users of mobile phones with MP3 capacity sometimes use their phones 
as mini-ghetto blasters while riding on trains and buses, holding the phone out toward 
others, thus claiming public space in alternative, if not particularly convivial, ways. The 
use of mobile phones in this way is called “sodcasting” in the UK and has produced a 
fervent debate in the pages of the Guardian newspaper concerning the “antisocial” 
nature of sequestering public space with such “treble-heavy” sound ( Guardian , Aug. 13, 
2010, P3). 

3 I take a critical theory perspective on the way in which new technologies such as the 
Apple iPod intégrate the user in new ways into what I refer to as “commodity culture.” 
Dant (2008) takes a “circuit of culture” approach to reach similar conclusions as to the 
structurally integrating nature of use. From this point of view, “individualism” is merely 

a structured ideological response embodied in the cognitive practices of users. Dant also 
points to the power of the Apple brand in integrating the user into a form of “soff 
capitalism.” For more on this see Bull (2008). 
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CHAPTER 23 


THE RECORDING 
THAT NEVER 
WANTED TO BE 
HEARD AND OTHER 

STORIES OF 
SO NI FI CATION 


JONATHAN STERNE AND 
MITCHELL AKIYAMA 


FrOM THE POSTSONIC TO THE SONIFIED 


In the spring of 2008, the incunabula of sound recording briefly broke into 
headline news. 1 A group called First Sounds had played a recording that dated 
back to 1860 (Rosen 2008; Maugh 2008). Anyone with an Internet connection 
could now hear “the world’s oldest recording.” This accomplishment raised a 
number of epistemological and historical questions. What does it mean to 
speak of a sound recording made in 1860? Thomas Edison’s phonograph, fírst 
demonstrated in 1877 and patented in 1878, is still widely considered to be the first 
device capable of recording sound and playing it back (Read and Welch 1976; 
Chanan 1994, 1995). However, First Sounds had succeeded in playing back a record- 
ing from a device that predates Edison’s: the phonautograph. Invented in 1857 by 
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Édouard-Léon Scott de Martinville, the phonautograph was conceived as a “sound 
writer.” It channeled sound through a conic funnel to víbrate a small membrane, 
which in turn vibrated a Stylus, thereby creating visible tracings on a recording 
surface — first a piece of smoked glass and later a sheet of paper rolled around a 
cylinder. Scott called the images it produced “phonautograms,” literally, “speech 
self-writings.” The mechanical similarities between Scott’s phonautograph and 
Thomas Edison’s cylinder phonograph are striking, a point that was not lost 
on nineteenth-century inventors. In the early 1870S Alexander Graham Bell had 
experimented with the phonautograph as a means of teaching deaf children. 
It was a modified versión of the device, employing a human ear as a diaphragm 
that, according to Bell, “gave him the idea for the telephone.” Edison also 
cited the phonautograph as an antecedent to his machines (Gitelman 2006; 
Sterne 2003; Bell 1878; Edison 1888). Yet, for all the similarities between the 
phonograph and the phonautograph, there existed no technology to play back 
phonautograph recordings as sound, at least until sometime shortly before 2008 
(Chanan 1994, 1995; Hankins and Silverman 1995; Lastra 2000; Read and Welch 
1976; Sterne 2003). 

In this chapter we use the story of the phonautograph as a way in and out of 
investigating the development of a cluster of practices called “sonification,” or 
the transformation of nonsonic data into audible sound. The story of the phonau- 
tograph is remarkable because it was conceived as a technology for visualization. Its 
recordings were never supposed to be heard, but now, thanks to the availability of 
tools for transforming the nonsonic into the audible, we are able to play back and 
listen to a recording that was intended to be seen, not heard. To cali this techno- 
logical process “playback” is not quite accurate since, as we will see, a great deal of 
reconstruction went into First Sounds’ efforts. Rather, we consider the possibility of 
listening to a phonautograph recording as marking an aesthetic and epistemic shift 
in the history of sound. This innovation is more than just a technological feat; it 
also speaks to a significant change in the ways in which we construct distinctions 
among the senses. After further introducing the phonautograph, we review a range 
of practices of sonification, all of which are instructive insofar as they move data 
and experience between the sonic and nonsonic registers. Our conclusión then 
returns to the case of the phonoautograph in order to advance some speculative 
propositions regarding the present conjuncture in the history of sound. The rela- 
tionship to sensation central to sonification (and the assemblages of media and 
practices on which it depends) is most distinctively characterized by the ability to 
transform data destined for one sense into data destined for another. This is not a 
claim about synesthesia or, worse, a desire to restore a so-called balance of the 
senses in an idealized sensorium. Rather, we argüe that this extreme plasticity lays 
bare the degree to which the senses themselves are articulated into different 
cultural, technological, and epistemic formations. The implication for sound 
studies is simple to grasp but difficult to handle. If we acknowledge the plasticity of 
sound, we must also acknowledge the limits of a commonsensical definition of 
the field’s object as “what can be heard” or “sound as an effect in the world.” 
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Scholars in the humanities and social Sciences still too often treat particular 
technologies or cultural forms as if they are predestined for or determined by a 
single sense. If we take the proposition of the plasticity of sound seriously, it is no 
longer possible to maintain such assumptions. Recent decades of sensory history, 
anthropology, and cultural studies have rendered banal the argument that the 
senses are constructed. However, as yet, sound scholars have only begun to reckon 
with the implications for the dissolution of our object of study as a given prior to 
our work of analysis. 

Why anchor a chapter on sonification to a visualization technology like the 
phonautograph? Scott’s phonautograph is often accorded a place in the history of 
sound technologies, but it was neither a sound-reproduction technology ñor a 
sonification technology. The phonautograph’s relationship to sound could best be 
described as postsonic. It produced an image after the occurrence of a sound — a 
record of a past sound event that it did not seek to reconstruct sonically. According 
to Scott, the phonautograph was never intended to play back its recordings. Scott 
even derided Edison’s phonograph on the grounds that it “merely reproduced 
sound — it was not a sound-writer” 2 The phonautograph was part of a boom in 
nineteenth-century devices that rendered visible otherwise invisible aspects of the 
natural world. Through a variety of technical feats, these technologies rendered the 
processes of nature — sound, electricity, biological processes and rhythms — as visual 
data that adhered to an orderly pattern. Scott and many of his contemporaries 
believed that to see sound was to better know it. The point of the phonautograph 
was to render sound available to careful visual examination, not to make recordings 
for playback (Hankins and Silverman 1995; Levin 1990). First Sounds 5 intervention 
was to ignore the inventóos intent and instead consciously and deliberately recon- 
struct phonautograph recordings as if they had been made for playback. 

It is a cliché of technological history that invento rs 5 intentions have a way of not 
working out and that whatever purpose or intention is built into a technology can 
change over time. The usual story is one of invention, innovation, engagement with 
users, and reconstruction ad infinitum as technologies achieve moments of tempo- 
rary closure, only to be opened up again and reconfigured from a new, unexpected 
angle (Bijker 1995; lenkins 2006; Pinch and Bijker 1984; Pinch and Oudshoorn 
2003). The story of the phonautograph, however, is a little different. A minor device 
in the history of sound recording, it never really left relative obscurity. Even its most 
ardent users generally moved on to other options as the nineteenth and twentieth 
centuries rolled on (Scripture 1906). The phonautograph appeared only as a curios- 
ity in histories of sound recording (when it was mentioned at all) until it was resur- 
rected by a few media historians at the turn of the twenty-first century (Hankins 
and Silverman 1995; Lastra 2000; Levin 1990; Sterne 2003). Its brief reemergence 
and short-lived posthumous fame signal a shift that occurred in sonic epistemology 
in the intervening years. While Scott and his contemporaries sought to transform 
sounds into images, today groups of scientists, artists, and technologists work to 
convert nonsonic phenomena so that they may be heard and therefore known as 
and through sound. In its time, the phonautograph helped construct sound as an 
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effect that could be reproduced without reproducing its cause. First Sounds’ accom- 
plishment of sonifying a phonautogram reveáis the plasticity of sound, its ability to 
convey data across different sensory registers. In addition, their work reveáis the 
degree to which data are fluid and are not necessarily tethered to any one sense. 

In the spirit of this handbook’s handbookness, we would like to cali attention 
to the usefulness of the concept of articulation for our argument. Drawn from the 
field of cultural studies, articulation theory might best be described as antiessential- 
ism in action (Grossberg 1992; Slack 1996; Hall 1986). Jennifer Daryl Slack and 
J. Macgregor Wise define articulation as “the contingent connection of different 
elements that, when connected in a particular way, form a specific unity.” For them, 
elements can be “made of words, concepts, institutions, practices, and effects, as 
well as material things.” Articulation theory is meant to draw attention to “the 
movement and flows of relationships” (Slack and Wise 2006, 127). So far, one might 
note similarities to Latour’s actor-network theory (Latour 1996) or the social 
construction of technology tradition cited earlier. Inasmuch as all three approaches 
are constructivist, there is a natural affinity. However, articulation theory and the 
cultural studies tradition from which it draws are more particularly attentive 
to questions of power and, for a lack of a better term, the density or cogency of 
articulations. Articulations: 

can and do change over time. But here, too, the speed and direction of change is 
contingent. Some articulations remain relatively tenacious; they are rather firmly 
forged and difficult to disarticulate. . . . Others might be more easily broken and 
thus subject to disarticulation and rearticulation. It all depends on the particulars 
of the nature of articulations at any particular historical moment. (Slack and 
Wise 2006, 128) 

In this chapter we use articulation in two ways: We show how sonification 
articulates a range of practices that render data for the ear, and we explore the ideas 
and practices articulated to hearing within sonification. However, our larger 
historical point is that, at the present moment, the articulations between particular 
senses and kinds of sense data are incredibly weak. If this is the case, it suggests that 
sound scholarship must be ever more vigilant about that shifting border between 
the sonic and the nonsonic. 

Our chapter therefore advances three nested methodological propositions. We 
argüe for attending to the modularity of sensory technologies; for the modularity of 
the relations between senses, subjects, and technologies; and, ultimately, for the 
modularity of the senses themselves. 


Parameters of Sonification 


Alternately known as sonification and auditory display , 3 the practice of turning data 
into sound as yet has no obvious epistemic center, and its overall impact on the arts 
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and Sciences is as yet unclear. Simply put, sonification is the use of nonspeech sound 
to convey information. This involves the translation or transposition of data 
from a given sensory mode into sound — a rearticulation of sorts. Techniques for 
analyzing or presenting data have tended toward the visual at least since the 
Renaissance (Ihde 2009). While there are early examples of attempts to use sound 
as a primary means of monitoring the world — Leonardo da Vinci is credited with 
inventing a form of sonar as early as 1490 — sonification is a relatively recent 
approach to rendering information sensible and has been promoted variously as a 
corrective or a supplement to visual display. 

In her chapter in this volume, Alexandra Supper elegantly renders the shape of 
current debates about the definition of sonification and the policing of the term’s 
borders among people who work in the field. For our chapter we have chosen a 
broad definition of sonification to highlight both the range of practices that might 
be enclosed in its big tent and the connections between sonification and other sonic 
practices. According to Gregory Kramer and Bruce Walker (2004), two pioneers of 
auditory display, sonification falls into three basic categories: alerts and notifica- 
tions, auditory icons, and audification. Alerts are the simplest instruments in the 
sonification toolbox. Fire alarms, sirens, the beeping of a microwave, and so on 
generally signify one thing: There’s a fire; get out of the way, an emergency vehicle 
is trying to pass; your soup is now hot. They are neither able ñor meant to commu- 
nicate anything more subtle than a binary possibility: “The fire alarm signáis that 
there is or is not a fire; not where or how hot it is” (Kramer and Walker 2004, 151). 

Auditory icons — or “earcons” — are capable of more subtle types of communi- 
cation. Associated mainly with computing (Robare and Forlizzi 2009; Gaver 1989; 
Blattner, Sumikawa, and Greenberg 1989), auditory icons might give feedback on a 
process — scrolling through a window or moving a folder to the trash, for example. 
In the late 1980S, William Gaver (1989) developed a sonic interface for Apple called 
SonicFinder. The interface was an ambitious attempt to sonify computing but was 
incorporated into mainstream operating systems only in a limited fashion. Not 
only did Gaver’s System alert the user that a process was occurring, it also employed 
a level of sonic nuance that could communicate subtle features of these operations. 
For example, selecting a file would trigger a “hitting” sound that would vary in 
pitch and timbre depending on its contents or size. Remnants of this approach 
remain buried in the user interfaces of common software applications; in many 
applications, brief blips and sound effects, for example, signal the arrival of an email 
or its successful departure. Microsoft has programmed a veritable cacophony of 
noises into its Office applications, which users can enable or disable as they please. 
We know of no writer who actually uses the many earcons available in MS Word, 
but they are there and available should you wish to be the first. 

While alerts and auditory icons communicate symbolically, audification 
functions isomorphically. Through the modification or the transduction of other 
vibrational or wave phenomena, audification renders data more immediately 
meaningful to a listener. In the early 1960S seismologist Sheridan Speeth (1961) pro- 
posed that amplifying and speeding up recordings of seismic movements could 
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allow listeners to differentiate between earthquakes and explosions caused by bombs. 
Sonocytology — the sonification of cells — operates similarly by amplifying the inau- 
dible vibrations of cellular life. As Sophia Roosth (2009) notes, yeast cells víbrate at 
frequencies within audible range. The volume, however, is far below the threshold 
of human hearing. Using scanning probe microscopes to record the vibration of 
these cells, sonocytologists have managed to sonify yeast cells and listen in on 
experiments. These scientists have found that adding alcohol to the cells 5 médium 
causes them to víbrate more rapidly, a phenomenon that, when sonified, results in 
higher-pitched sound, giving the impression that they are screaming. 

Many recent developments in sonification have happened thanks to computers 
or, more specifically, digitization. Here we confront one of the defining features of 
“new” media as described by Lev Manovich (2001, 45-48): transcoding. Because 
digital files exist as both Computer data and as aesthetic representations of the sen- 
sible world (in the form of an interface), a wide range of possibilities exists for both 
the manipulation of data and the interface itself. When an audio editor displays the 
waveform of a recording, it offers a simple and fairly straightforward kind of 
transcoding. Given audio data, it renders a picture that represents some aspects of 
the audio recording’s waveforms. More Creative options are also possible; for 
instance, an application called Metasynth allows users to actually generate sounds 
from the data found in pictures (http://www.uisoftware.com/MetaSynth/index. 
php). When lasers map the curves of oíd phonographic cylinders or trace the vibra- 
tions of oíd phonautograph tracings, they are similarly performing a kind of 
transcoding. Pace Manovich, transcoding is not necessarily a digital process. The 
study of pulsars in radio astronomy, for example, has for a long time rendered the 
rotation of distant stars as sonic data because the regularity of their rotations can be 
better heard than seen. However, transcoding becomes considerably more common 
in the digital realm because software allows for greater ease and facility in the 
manipulation of data sets. The pulsar example is instructive because it reveáis one 
of the central features of audified knowledge, a feature that works because of 
transcoding — let us cali it “effective indexicality. 55 

Audification requires an assumption about the indexicality of the reproduction 
or manipulation of a phenomenon. The listener must believe that the sound 
produced is, for all intents and purposes, made of the same stuff as the object it 
is meant to represent. While a microwave’s beep is symbolic and arbitrary, audifica- 
tion is indexical in that patterns of sound are directly caused by patterns in the data. 
Here we borrow the distinction between indexes and symbols from Charles Sanders 
Peirce’s semiotic theory. Peirce defined a Symbol as a sign that has an arbitrary 
relationship to its signifier, such as words in language. He defined an Índex as a sign 
with a causal relationship to its signifier — a weathervane, for example (Peirce 1955, 
107; Turino 1999). In audification, patterns in data are made to “cause 55 particular 
sounds to occur, which in turn render those patterns sensible in ways that may 
be harder to discern visually. Although there is an arbitrary dimensión to the 
relationship — someone must choose which sounds to associate with which data — 
the resulting relationship is meant to be experienced as one of cause and effect. 
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Human hands also construct Peirce’s weathervane, and its shape as a rooster or an 
arrow, while arbitrary, does not affect its indexical relationship with the wind. 
Because the indexicality here is a delibérate effect of a human-made representa- 
tional system, we cali it “effective” — it is more important for the audifier to believe 
that the relationship between sound and object is indexical than it is for the 
relationship to actually be indexical. (Some forms of indexicality in Peirce are less 
“human made,” as in the pain caused by a touching a hot stove, though there, too, 
we would note that a certain level of belief in pain as an accurate índex of danger 
is required.) 

Whether it apprehends information through symbolic or indexical representa- 
tion, proponents of audification argüe that the ear is particularly good at certain 
things, superior even to the eye. The ear’s ability to perform monitoring tasks has 
been exploited for decades with devices like the Geiger counter, sonar, and heart 
monitors (Kramer 1994). We can listen, recognizing patterns or anomalies, without 
tiring or even necessarily devoting our full attention to the task at hand. The 
sonification of patterned data — stock market trends, network trafile, vital signs, 
and so on — capitalizes on the ear’s ability to detect anomalies that might “pop out” 
of a continuous stream of information (Kramer 1994; Hermann, Niehus, and Ritter 
2003). The eye isn’t as proficient at picking out discontinuities while the ear is highly 
attuned to “wrong notes.” In contrast to hearing, visión is often described as a 
focused sense, capable of parsing only one thing at a time. The ear, however, can 
discern and parse múltiple sounds simultaneously. Proponents of sonifications 
characterize hearing as a faculty that is more sensitive to direction, a useful strength 
in analyzing spatial data or receiving and acting upon directional cues. Sonification 
discourse also suggests that the ear is more sensitive to affect than the eye (Barrass 
and Kramer 1999; Polli 2003), and, as such, talk about and around sonification 
often partakes of a long-standing romantic ideology that is attached to hearing 
in many cultural fields. The idea that the ear and the sonic arts are somehow 
inherently closer to the seat of emotion and affect is strongly rooted in Western 
traditions that shape the way we think about music, speech, and other forms 
of auditory expression (Flinn 1992, 7, 9; Sterne 2003, 15). We do not need to 
believe in the truth of these assertions, however, to appreciate their effectiveness in 
shaping how practitioners relate to, make use of, and construct arguments about 
sonification. 


Aesthetics of Sonification 


Perhaps it is the romantic idea of sound’s affectivity that has led to growing interest 
in sonification in the art world. While it is true that in most cases the making audi- 
ble of information is largely utilitarian, in many others the lines between scientific 
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and artistic production are blurred. As Alexandra Supper notes (in chapter 10 ), this 
is a major point of concern among some advocates of sonification since the field is 
itself seeking legitimacy among scientists. This attempt at disciplinary respectability 
is complicated by the relative impermeability of the boundaries between art and 
Science. As Supper shows, sonification researchers who are inclined toward inter- 
disciplinary projects often have no choice but to align themselves with the terms, 
language, and expectations of one field or the other. For practitioners whose work 
might be more clearly aligned with the empirical, presenting a project in an artistic 
mode might undermine its credibility. 

There are precedents for the hybridization of the empirical and the expressive. 
Georgina Born and Andrew Barry (2010) develop a genealogy of the emerging field 
they cali “art-science.” They identify three “logics” operating in these fields: account- 
ability, innovation, and ontology (105). First, the aestheticization of data often 
brings a work to the attention of a public that would not necessarily normally have 
access to it. Insofar as an artistic mode of presentation brings researchers’ work into 
a wider public, it is submitted to scrutiny, and researchers are forced to account for 
their findings. Second, the art-science pairing is often touted as producing innova- 
tions through synergy and collaboration. Third, artistic interpretations and presen- 
tations of research often work to cali the very nature of scientific inquiry into 
question. Here we find art and Science seemingly offering each other a chance at 
mutual legitimacy. Science stands to benefit from art insofar as the aesthetic puta- 
tively performs the function of making the empirical accountable (109). On the 
other hand, art practices gain epistemological credibility thanks to their supposedly 
Creative contributions to knowledge (110). 

Since its beginnings as a coherent field, this push-pull between art and Science 
has played out in sonification discourse (Supper 2010). Despite its appeals to scien- 
tific rigor, sonification is an interpretive and aesthetic process. This is true for the 
actual sounds it produces, the reactions of listeners, the interfaces of sonification 
technologies used, and the relationships between sonic and nonsonic data in 
specific sonifications. 

Projects by several individuáis — and in many cases teams — present sonified 
data sets as works of art. Andrea Polli’s (2005) “Atmospherics/Weather Works” 
employs a fifteen-channel sound system to re-create significant storms in the 
New York/Long Island area. Polli and her team mapped several variables — atmo- 
spheric pressure, relative humidity, wind speed, and so on — to an assigned set 
of recordings that included vocals sounds and wind instruments, as well as environ- 
mental and insect sounds. As such, “Atmospherics” doesn’t simply present record- 
ings of the storms; rather, it uses arbitrarily assigned sounds to create an affective 
representation of weather. Polli’s stated intentions make it clear that her work is 
meant to straddle the line between art and Science: 

As an artist sonifying atmospheric data, I am interested in the creation of new 

forms of data interpretation. As individuáis and groups are faced with the need 

to interpret more and more large data sets, a language or series of languages for 
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communicating this mass of data needs to evolve. In my artwork, I have tried to 
develop strategies for the interpretation of data through sound that has both 
narrative and emotional content because I believe that an emotional connection 
with data can increase the human understanding and appreciation of the forces at 
work behind the data. (Polli 2005, 33) 

Other similar examples inelude “Life Music,” a collaboration between 
composer John Dunn and biologist Mary Anne Clark (1999) that sonifies DNA 
sequences by assigning pitches to amino acid sequences; Thierry Delatour’s 
“Molecular Music” (2000) makes audible the vibrational spectra of molecules; 
Navegar é preciso, viver nao é preciso by composer Alberto de Campo and sociologist 
Christian Dayé (2006) sonically represents a variety of socioeconomic data using 
the time/space coordinates of Magellan’s 1519-1522 circumnavigation of the globe. 

The practice of using data as raw material for art has a pedigree that reaches 
back at least as far as the conceptual movements of the 1960S. Sound artists, 
including John Cage, Alvin Lucier, and Charles Dodge, along with installation art- 
ists like Hans Haacke, Sol Lewitt, and Dan Graham, broke away from the prior 
conception of artists as individuáis that realize works according to their own 
uniquely personal visión. One of Cage’s many contributions to the dismantling of 
the romantic notion of the artist as inspired genius was the work Atlas Eclipticalis , 
in which he superimposed music paper on top of star charts and plotted musical 
compositions as though they were constellations. In I Am Sitting in a Room Alvin 
Lucier sonified the acoustic characteristics of a room by recording his speech and 
then playing the recording back into the room, rerecording, playing it back, 
rerecording, and so on until his vocal signature was thoroughly eroded by succes- 
sive generations of mediation. 

Works like these brought natural and artificial systems to the fore by putting a 
process or phenomenon ahead of the artist’s ostensibly personal “visión.” What 
these works also implicitly proposed was that phenomena beyond the perception of 
human senses might somehow be represented or reified. Lev Manovich, writing 
about contemporary visual art that has followed a similar line, suggests: 

If Romantic artists thought of certain phenomena and effeets as un-represantable 
[sic], as something which goes beyond the limits of human senses and reason, 
data visualization artists aim at precisely the opposite: to map such phenomena 
into a representation whose scale is comparable to the scales of human percep- 
tion and cognition. (Manovich 2002) 

As we’ve seen, this move to render data sensible is at the core of sonification. In 
this sense artists and researchers not only are allied in seeking a common objective 
but often also work together and occasionally even take on both roles. These artist/ 
researchers, who often publish in journals like Leonardo or Computer Music Journal , 
have tended to present their work as a fusión of art and Science. The justification 
is that sonification can provide real insight into physical or informational 
processes, as well as serve to create or reinforce a sense of immediacy or relevance. 
The artistic presentation of sonified data is the verso of the phonautogram’s 



THE RECORDING THAT NEVER WANTED TO BE HEARD 


553 


recto. Resolutely experimental in its approach and outcomes, the phonautograph 
occupied an ambiguous place between art and Science. Perhaps this is a central 
feature, more generally, of the process of transcoding from one sense to another. 
Not quite synesthetic, not quite scientific in a classic sense, the conversión of data 
between the senses nevertheless presents itself as having a certain empirical weight 
and is conditioned by the aesthetic sensibilities of inventors, experimenters, 
and users. 


Accessibility 


Sonification discourse has predominantly been concerned with the interpretation 
and display of data, but it has also been widely touted as an important tool for 
creating accessibility for people who are blind. The benefits of sonic alerts, earcons, 
and audified information for those with limited or no visión are obvious, but there 
is one sonification technique that is arguably even more useful: human echoloca- 
tion. Through rigorous training, some blind individuáis have learned to navigate 
the world with remarkable aptitude. Daniel Kish, the most notable proponent of 
human echolocation, lost his sight in his infancy but has gradually learned how to 
negotiate the world acoustically (Kish 2009). By emitting vocal clicks, Kish manages 
to negotiate the world with a grace that is incomprehensible to sighted people. Even 
more astonishing, he is able and has taught others to ride bicycles. He is also able 
to sense and describe objects around him in detail; with training and practice 
echolocation is not only able to reveal the presence and placement of objects but 
can also help a listener to discern size, texture, and density. 

In that echolocation is a technique that uses sound to convey or relay informa- 
tion, it is consistent with the most basic definition of sonification. However, unlike 
other forms, it does not opérate in the symbolic realm, creating representations of 
data or phenomena. Rather, it is a technique for emplacement, for an immediate 
experience of a space. The sounds that the echolocator produces and then receives 
do not represent or tabúlate the world; they present it in its immediate plenitude. 
In the same way that the reflection of light off an object constitutes the unmediated 
experience of seeing, the reflection of sound off a thing constitutes an unmediated 
experience of hearing. This is arguably true of a recent technological extensión 
of human echolocation, FlashSonar (Kish 2009). Developed in part by Kish, 
the device emits clicks at a rate, volume, and endurance beyond the capacity of 
a human. While the sound that the FlashSonar employs has been arbitrarily 
chosen, the “image” of the world that the listener receives is not represented; it is 
immediate. 

In all of these examples of sonification and the discussions it has precipitated, 
one might detect a slight corrective. The argument is that data have historically 
been largely represented by visual means. This should come as no surprise given the 
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inherent visual bias of Western Science (Ihde 2007, 2009). There is a long history in 
both scientific thought and the philosophy of Science of privileging visión as the 
sense most likely to yield scientific knowledge. Vision has not only been favored 
epistemologically but has also been granted metaphorical dominance. The preva- 
lence of light and sight metaphors have been with us at least since the ancient 
Greeks; Don Idhe and others have brought awareness to this visualist legacy in 
scientific thought (Peirce 1955; Foucault and Gordon 1980; Virilio 1989; Jay 1993; 
Hillis 1999; Turino 1999). To be more precise, this bias is not toward visión as such 
but a particular construction of sight as a sense that objectifies and separates subject 
from object. One result is that while there exists a long history of rendering 
sound as visual data — a history coterminous with the rise of modern acoustics and 
hearing Science (Kittler 1999; Ferguson and Cabrera 2008), sonification has only 
recently gained traction as an epistemological innovation in itself. 


Reanimation as Sonification 


Here we return to the First Sounds’ work, for in a way their project inverts the oíd 
visualist bias. If sound could be transcoded into image, it stood to reason that image 
could be transcoded into sound. The phonautograph was designed to turn sound 
into image, not to play it back. Flowever, First Sounds were able to reverse this 
process and thereby render the images on the phonautograms as sound. The group’s 
2008 success in playing back one of Scott’s phonautograms was accomplished using 
technology inspired by new practices in the preservation of oíd phonographic 
cylinders. Because some cylinders are cióse to one hundred years oíd, laying a needle 
on them might potentially damage (if not destroy) them in the act of playback. 
Instead of laying a needle on the grooves of a wax cylinder, newer apparatus use a 
láser to sean the media; Computer software then reconstructs the audio. In the case 
of the phonautogram, a sheet of paper blackened by the smoke of an oil lamp 
and then scratched with a Stylus, the process is similar; a “virtual Stylus” scans the 
document and converts the markings into digital information that can be rendered 
as sound (Hennessey and Giovannoni 2008). 

Again, unlike phonographic cylinders, phonautograms were never supposed to 
produce sound: Scott’s phonautograms were meant to be “read.” What is perhaps 
most significant about First Sounds’ achievement is that they succeeded in 
contravening Scott’s intentions by reading these objeets so that we can hear them. 
This step has been problematic since the process was invented: The patterns that 
phonautograms display do not have any apparent meaning to the untrained viewer. 
This opacity prevented early would-be users from finding much use for the objeets. 
Bell, for example, had hoped to use the device to train deaf students to speak by 
teaching them to modulate their voices until the phonautogram looked like 
one from his own speech. While elegant, the plan was unsuccessful because the 
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phonautograph did not provide the necessary visual detail (Bell 1878). This remained 
the case until it became possible for lasers to sean cylinders. Following this innova- 
tion, the members of First Sounds hypothesized that it might also be possible to 
extract sound from phonautograph recordings (though the idea of hearing pho- 
nautograms probably extends all the way back to their first public presentation). 
They retrieved Scott’s original patent deposits from France and then digitally 
encoded them at Berkeley Labs. Their first few attempts were largely unintelligible; 
because Scott’s phonautograph lacked a governor and had to be cranked by hand, 
the recording speed was erratic and had to be calibrated to be intelligible. First 
Sounds had the arduous task of manually reworking the speed of the recording so 
that it played at a consistent pitch and rate (http://firstsounds.org; Giovannoni to 
Sterne 2008). Sound machines always need a little help from their human auditors, 
but in this case the phonautograph required at least three kinds of assistance — from 
its inventor, from First Sounds, and from us, its listeners. 

First Sounds 5 first success was an 1860 recording of a Au Claire de la Lune. 55 The 
original phonautogram had notations that signaled which part of the line was made 
by which word. First Sounds had known what they were looking for all along; what 
these clues offered was a map. Phonautographic recording couldn’t take advantage 
of modern engineering practice: The machine was not capable of time stamping 
and did not use reference tones or pitches. We have already seen that, from Scott’s 
perspective, phonautograms, though sound recordings in the sense that they were 
sonic vibrations inscribed on paper, were not sound recordings in the sense we 
think of them today. Rather, they were something closer to sound inscriptions, 
fossils, or traces. Scott did not intend for them to be heard or understood in 
sonic terms; rather, they were meant to help observers understand sound through 
visualization. Perhaps the great irony of the device is that it failed as a sound writer 
but, with a good deal of help, ended up being involved in the historical reconstruc- 
tion of sound from 1860. 


LaUGHING WITH THE DeAD 


The admittedly minor career of Scott’s phonautograph and its phonautograms 
illustrates the articulated nature of all sound technologies and sonic practices; 
that is to say, the phonogram is necessarily understood differently in varying for- 
mations. In the formations of nineteenth-century Science — in an epistemological 
paradigm where seeing and knowing were closely coupled — the phonautograph 
was a part of a long line of machines that rendered physical processes visible. In 
this sense, it was almost incidental that the phonautograph was dedicated to the 
transduction of sound since it shared an epistemic relationship with graphical 
devices that traced a whole range of physical processes. In the formation of 
nineteenth-century sound reproduction, the phonautograph stands as part of an 
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important shift toward machines that treated sound as a reproducible effect. In this 
alternative formation, the phonautograph’s ability to transduce sound is essential 
for its historical and cultural legibility. Today, neither of those formations holds, 
and the phonautograph’s means shifts again as a result. The possibility of playing 
back a phonautogram was unimportant to Scott; the device was a means of making 
sound more comprehensible in a paradigm where comprehension often meant 
visibility. However, the possibility of playing back a phonautograph is now impor- 
tant to people for a range of reasons. Sonification offers a utopian proposal similar 
to Scott’s hopes for phonautography — but in reverse. It promises to turn a world of 
information into sound. First Sounds’ work therefore lies somewhere in between 
the audification of nonsonic data and the digital transcoding and reconstruction of 
analog sonic data. In either scenario, Scott’s inten tions for his machine have become 
irrelevant. A phonautogram’s indexicality makes it recognizable when sonified or, 
more accurately, almost recognizable if one is instructed in what to listen for. 

There is an obvious novelty factor in hearing an 1860 recording, but there is 
more to it than that. By way of conclusión, let us step beyond that to consider some 
possible historical, aesthetic, and epistemological implications of First Sound’s 
sonification of phonautograms. As we have argued throughout, sonification is a 
particular articulation of the sonic and the nonsonic, one that points to an increas- 
ing vagueness of the borders around the audible world. Sound studies of the present 
moment must therefore wrestle in new ways with the boundaries of their objects. 
The field must let go of its axiomatic assumptions regarding the givenness of a 
particular domain called “sound,” a process called “hearing,” or a listening subject. 
This is not a cali for a kind of everything-goes postmodernism but rather a reminder 
of the articulatedness of sensory technologies, sense data, and the senses themselves. 
In the rise of sonification, we note an increasingly forceful articulation of the 
senses as permeable and susceptible to transcoding. Thus, we end by wrestling 
with our own reactions to the phonautogram playback. Here are three possible 
interpretations: 

1. The “we are now closer to Scott” interpretation. Most writers on the phonauto- 
graph appear to agree that there is an undeniable morphological similarity between 
Scott’s phonautograph and Edison’s phonograph. Nonetheless, there is an epistemic 
gap with Scott and Bell on one side and Edison on the other because of the latter’s 
privileging of inscription. If this is the case, one could conceivably read our present 
moment as coming around to Scott. 

Thanks to the design of Computer software, we live in an era of unprecedented 
visualization of sound. This is a result not only of sonification practices but also 
of the ways in which software engineers have chosen to display sonic data and to 
construct the sound-mixing and editing process through the design of Computer 
interfaces. To be clear, this is a design decisión and not a necessary consequence of 
digital technology. There are many more occasions to “look at sound” in a digital 
recording studio than in its analog predecessor. Those of us who have mixed audio 
with a mouse instead of a considerably more tactile mixing board know well the 
impulse to set valúes in terms of whole numbers and to draw straight curves in our 
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edits not because they sound right but because they look right. First Sounds’ digital 
sonification of Scott’s phonautogram was similarly calibrated to the medium’s 
visual appearance. 

2. Despite the prevalence of visualization, we are also living in an age of 
unprecedented sonification, which is potentially a reversal ofScotfs process. In some 
important ways, First Sounds’ enterprise demonstrates the degree to which con- 
temporary thinking inverts Scott’s proposition — that the purpose of manipulating 
sound was to transduce it into visible data. Today, anything can be turned into 
sound, and in some cases it is quite useful to do so. Sound reproduction has extended 
into an ever-increasing number of spaces of everyday life (becoming mundane 
in the process), and more and more nonsonic phenomena have come to be 
comprehensible through sound. Friedrich Kittler has suggested that the sound-re- 
production technologies of the late nineteenth century deprivileged the voice. 
Instead, it treated all sounds the same; it submitted them to a logic that dictates that 
“frequencies are frequencies” regardless of their source (Kittler 1999). Sonification 
takes this one step further by converting the frequencies of any given data set into 
an audible spectrum. There is no better example of this than our ability to now 
listen to sound recordings that were never supposed to be reconstituted as sound. 
With the phonautogram, as with the yeast cell and the pulsar, we can now listen to 
that which was never meant to be heard. 

3. There is also a broader conclusión to be made here about the plasticity of data 
in digital schemes and the dissolution ofold knowledge about the senses . In our age, the 
very idea of audition is so different from that of Scott’s era that one must wonder 
whether the sound represen ted by his phonautograms is really the same sound that 
we hear coming out of the speakers when we play it back today. In Scott’s time, 
sound was viewed as a substance, and different sounds were given different valúes. 
Bell and Edison probably still understood sound as having a certain “thisness” to it 
out in the world even though they also understood it as reproducible. For them, 
speech or music (or birdsong or wind) had a certain ontological coherence and 
essential character. In current use, sound-reproduction technology constructs 
sound as nothing more than a stratification of vibration by the body. Those vibra- 
tions within the audible range are considered sound; those outside the audible 
range are interpreted haptically or are not perceived at all. But even the unity of 
“sound” as a perceptual category is an illusion of language. We think of the ear as a 
transducer, as something that changes “sound” into vibrations and then into elec- 
trical and neural signáis that are experienced as sound. While this integration exists 
at the perceptual level, it does not necessarily exist at the biological level. Cognitive 
scientists currently believe that different regions of the brain process different kinds 
of sounds. If there was a gap between Bell and Edison’s understanding of sound and 
Scott’s, one could posit another gap between our sonic world and the sonic world 
in which all three inventors operated. 

If there is discontinuity on one stratum, there is continuity on another. In a 
visceral way, the work of First Sounds pushes the audible history of recording back 
seventeen years from 1877 to 1860. If Scott’s recording is intelligible to listeners 




558 


THE OXFORD H ANDBOOK OF SOUND STUDIES 


as a recording , albeit a bad one, then the oíd Victorian obsession with the voices of 
the dead serenading the living sticks with us even now when the dead sing to us 
every day on classic rock radio stations. We might also tip our hats to Freud’s com- 
ments on the uncanny nature of recording (Freud 1989). When the “Au Claire de la 
Lune” recording was played back on BBC, the news reader couldn’t control her 
laughter as her script moved from technological marvel to obituary (Richards and 
Sherwin 2009). If she could not help laughing upon hearing a dead voice (barely) 
reanimated, perhaps it is because, in the age of sonification, we have the power to 
make anything laugh back. 


NOTES 


1 We would like to thank the editors, Douglas Kahn, and each other for helpful 
suggestions on the chapter. 

2 Édouard-Léon Scott de Martinville, Le Probléme de la parole s’écrivant elle-méme: 

La France , VAmérique (París, 1878), cited in Hankins and Silverman (1995, 137) and in Levin 

(1990). Levin cites Scott in support of his thesis that, just as early cinema was heralded as a 

transparent reproduction of images that would supersede national 

languages, so the prehistory of sound recording articulated an “analogous discourse 

of democratization and univocal, natural signs.” page 36 

3 While the two terms have generally been used interchangeably, Thomas 
Hermann (2008) has recently suggested that “sonification” be used to denote the 
principie of rendering data as sound while “auditory display” should refer to the 
apparatus or technical system. Acknowledging this distinction, we use the term 
sonification as a catch-all. 
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